That would be true if they used material that way paywalled. But the vast majority of the training information used is publicly available. There’s plenty of freely available books and information that you only require an internet connection for to access, and learn from.
Comment on The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates
arin@lemmy.world 2 months ago
Kids pay for books, openAI should also pay for the material access used for training.
ClamDrinker@lemmy.world 2 months ago
FatCat@lemmy.world 2 months ago
OpenAI like other AI companies keep their data sources confidential. But there are services and commercial databases for books that people understand are commonly used in the AI industry.
EddoWagt@feddit.nl 2 months ago
“We trained on absolutely everything, but we won’t tell them that because it will get us in a lot of trouble”