Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit before joining the Threadiverse as well.

  • 0 Posts
  • 138 Comments
Joined 1 year ago
cake
Cake day: March 3rd, 2024

help-circle




  • Interestingly, I’m not seeing your quoted content when I look at this article. I see a three-paragraph-long article that says in a nutshell “people don’t visit source sites as much now that AI summarizes the contents for them.” (Ironic that I am manually summarizing it like that).

    Perhaps it’s some kind of paywall blocking me from seeing the rest? I don’t see any popup telling me that, but I’ve got a lot of adblockers that might be stopping that from appearing. I’m not going to disable adblockers just to see whether this is paywalled, given how incredibly intrusive and annoying ads are these days.

    Gee, I wonder why people prefer AI.









  • I don’t see what distinction you’re trying to draw here. It previously had trouble generating full glasses of wine, they made some changes, now it can. As a result, AIs are capable of generating an image of a full wine glass.

    This is just another goalpost that’s been blown past, like the “AI will never be able to draw hands correctly” thing that was so popular back in the day. Now AIs are quite good at drawing hands, and so new “but they can’t do X!” Standards have been invented. I see no fundamental reason why any of those standards won’t ultimately be surpassed.


  • The judge writes that the Authors told him that LLMs memorized the content and could recite it. He then said “for purposes of argument I’ll assume that’s true,” and even despite that he went ahead and ruled that LLM training does not violate copyright.

    It was perhaps a bit daring of Anthropic not to contest what the Authors claimed in that case, but as it turns out the result is an even stronger ruling. The judge gave the Authors every benefit of the doubt and still found that they had no case when it came to training.




  • Again, you should read the ruling. The judge explicitly addresses this. The Authors claim that this is how LLMs work, and the judge says “okay, let’s assume that their claim is true.”

    Fourth, each fully trained LLM itself retained “compressed” copies of the works it had trained upon, or so Authors contend and this order takes for granted.

    Even on that basis he still finds that it’s not violating copyright to train an LLM.

    And I don’t think the Authors’ claim would hold up if challenged, for that matter. Anthropic chose not to challenge it because it didn’t make a difference to their case, but in actuality an LLM doesn’t store the training data verbatim within itself. It’s physically impossible to compress text that much.


  • Is it this?

    First, Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16).

    That’s the judge addressing an argument that the Authors made. If anyone made a “false equivalence” here it’s the plaintiffs, the judge is simply saying “okay, let’s assume their claim is true.” As is the usual case for a preliminary judgment like this.