Comment

Comment on AI’s Memorization Crisis | Large language models don’t “learn”—they copy. And that could change everything for the tech industry.

<- View Parent

TheBlackLounge@lemmy.zip ⁨2⁩ ⁨months⁩ ago

As you can learn from reading the article, they do also store the information itself.

They learn and store a compression algorithm that fits the data, then use it to store that data. The former part of this is not new, AI and compression theory go back decades. What’s new and surprising is that you can get the original work out of attention transformers. Even in traditional overfit models that isn’t a given. And attention transformers shine at generality, so it’s not evident that they should do this, but all models tested do it, so maybe it is even necessary?

Storing data isn’t a theoretical failure, some very useful AI algorithms do it by design. It’s a legal and ethical failure because openai etc have been claiming from the beginning that this isn’t happening, and it also provides proof of the pirated work it’s been trained on.

source

Sort:hotnew top

leftzero@lemmy.dbzer0.com ⁨2⁩ ⁨months⁩ ago
The images on the article clearly show that they’re not storing the data, they’re storing enough information about the data to reconstruct a rough and mostly useless approximation of the data (and they do so in such a way that the information about one piece of data can be combined with the information about another one to produce another rough and mostly useless approximation of a combination of those two pieces of data, which was not in the original dataset).

The legal and ethical failure is in commercially using the artist’s works (as a training model) without permission, not in storing or even reproducing them, since the slop they produce is evidently an approximation and not the real thing.

It’s like playing a telephone game with a description of an image, with the last person drawing the result.

source
- TheBlackLounge@lemmy.zip ⁨2⁩ ⁨months⁩ ago
  The law disagrees. Compression has never been a valid argument. A crunchy 360p rip of a movie is a mostly useless approximation but sharing it is definitely illegal.
  
  source