AI already trains on Wikipedia.
Comment on Wikipedia has banned AI-generated text, with two exceptions
errer@lemmy.world 22 hours agoWikipedia probably wants to sell access to LLMs to train. It’s only valuable if Wikipedia remains a high-quality, slop-free source.
I think even AI zealots think there should be silos of content to train from that are fully human generated. Training slop on slop makes the slop even worse.
SuspciousCarrot78@lemmy.world 21 hours ago
MountingSuspicion@reddthat.com 20 hours ago
This was only done because the editors pushed to minimize AI involvement. There’s a comment here already mentioning that: lemmy.world/comment/22826863
Grimy@lemmy.world 21 hours ago
Sell licenses of what? It’s already all in the creative commons iirc.
Zagorath@quokk.au 17 hours ago
The content is CC licensed, but they are trying to block AI scraping because it overloads their servers. They have a paid API that uses a lot less compute for both Wikipedia and the AI, as well as being a revenue source for Wikipedia.