There’s a big difference between borrowing inspiration and just using entire paragraphs of text or images wholesale. If GRRM uses entire paragraphs of JK Rowling with just the names changed and uses the same cover with a few different colors you have the same fight. LLM can do the first, but also does the second.
The “in the style of” is a different issue that’s being debated, as style isn’t protected by law. But apparently if you ask in the style of, the LLM can get lazy and produces parts of the (copyrighted) source material instead of something original.
wewbull@feddit.uk 10 months ago
This story is about a non-fiction work.
What is the purpose of a non-fiction work? It’s to give the reader further knowledge on a subject.
Why does an LLM manufacturer train their model on a non-fiction work? To be able to act as a substitute source of the knowledge.
End result is that
So, not only have they stolen their work, they’ve stolen their income and reputation.
bassomitron@lemmy.world 10 months ago
If you’re using an LLM as any form of authoritative source-and literally any LLM specifically warns NOT to do that–then you’re going to have a bad time. No one is using them to learn in any serious capacity. Ideally, the AI should absolutely be citing its sources, and if someone is able to figure out how to do that reliably, they’ll be made quite rich, I’d imagine.
Stoneykins@mander.xyz 10 months ago
For someone who claimed to not be a fan of OpenAI, you sure do know all the fan arguments against regulation for AI.
bassomitron@lemmy.world 10 months ago