Comment on Asking ChatGPT to Repeat Words ‘Forever’ Is Now a Terms of Service Violation

<- View Parent
TWeaK@lemm.ee ⁨11⁩ ⁨months⁩ ago

But the fact is the LLM was able to spit out the training data. This means that anything in the training data isn’t just copied into the training dataset, allegedly under fair use as research, but also copied into the LLM as part of an active commercial product. Sure, the LLM might break it down and store the components separately, but if an LLM can reassemble it and spit out the original copyrighted work then how is that different from how a photocopier breaks down the image scanned from a piece of paper then reassembles it into instructions for its printer?

source
Sort:hotnewtop