They always were.
Only now they’ve agreed to pay Reddit for it. This is what their third party lockdown was really all about.
They’re helping themselves to your Lemmy comments for free, as that’s just how it’s designed. If you post anything publicly anywhere, it’s getting slurped up by a bot somewhere.
myliltoehurts@lemm.ee 5 months ago
So they filled reddit with bot generated content, and now they’re selling back the same stuff likely to the company who generated most of it.
At what point can we call an AI inbred?
orca@orcas.enjoying.yachts 5 months ago
This is actually a thing. It’s called “Model Collapse”. You can read about it here.
FaceDeer@fedia.io 5 months ago
"Model collapse" can be easily avoided by keeping old human data with new synthetic data in the training set. The old archives of Reddit content from before there was AI are still around.
noodlejetski@lemm.ee 5 months ago
I prefer “Habsburg AI”.
restingboredface@sh.itjust.works 5 months ago
I wonder if Open AI or any of the other firms have thought to put in any kind of stipulations about monitoring and moderating reddit content to reduce ai generated posts and reduce risk of model collapse.
Anybody who’s looked at reddit in the past 2 years especially has seen the impact of ai pretty clearly. If I was running open ai I wouldn’t want that crap contaminating my models.