It’s quite similar to another situation known as data incest
Comment on Dreams come true
runner_g@lemmy.blahaj.zone 1 year agoSomeone’s probably already coined the term, but I’m going to call it LLM inbreeding.
Benn@lemm.ee 1 year ago
chicken@lemmy.dbzer0.com 1 year ago
The real term is synthetic data
itslilith@lemmy.blahaj.zone 1 year ago
but it amounts to about the same
chicken@lemmy.dbzer0.com 1 year ago
thesporkeffect@lemmy.world 1 year ago
Soylent AI? Auto-infocannibalism
Naz@sh.itjust.works 1 year ago
I suggested this term in academic circles, as a joke.
I also suggested hallucinations ~3-6 years ago only to find out it was ALSO suggested in the 1970s.
Inbreeding, lol
anzo@programming.dev 1 year ago
There was some research article applying this 70s computer science concept to LLMs. It was published in Nature and hit major news outlets. Basically they further trained GPT on its output for a couple generations, until the model degraded terribly. Sounded obvious to me, but seeing it happen on the www is painful nonetheless…