Not completely true. It just needs to be data that is organic enough. Good AI generated material is fine for reinforcement since it is still material (some) humans would be fine seeing. So more like: it needs to be human approved.
Comment on How do I "sabotage" my own online content to throw a wrench in AI training machines?
howrar@lemmy.ca 1 week agoThe only quality that LLMs really need is that the data is human-made.
ClamDrinker@lemmy.world 1 week ago
ohulancutash@feddit.uk 1 week ago
Yeah but how does OP know that their original comments aren’t going to buffer up the data anyway. Flat Earthers for example.