Reddit is actually pretty decent for training llms. Funny enough an ai finetuned on 4chan does better in intelegence benchmarks.
Comment on Reddit Signs AI Content Licensing Deal Ahead of IPO
BlueEther@no.lastname.nz 8 months ago
I’ve been on reddit, I don’t know that I would like to use a LLM trained on much of the content there (excluding tech/DIY space)
muntedcrocodile@lemmy.world 8 months ago
Gonkulator@lemm.ee 8 months ago
“Intelegence”. Oh the irony.
lvxferre@mander.xyz 8 months ago
“Finetuned”, “Intelegence”. Oh the irony.
Focus on what is being said, not how it is said. The comment is silly but its usage of non-standard spelling has jack shit to do with it.
Gonkulator@lemm.ee 8 months ago
No thanks. Im going to go ahead and focus on what I choose. But thanks for your input.
muntedcrocodile@lemmy.world 8 months ago
Your unwarranted fixation on spelling in an online forum blatantly exposes your glaring dearth of insight beyond superficiality, a trait that most likely mirrors the shallowness dwelling within you.
pineapplepizza@lemm.ee 8 months ago
Source? Or BS?
muntedcrocodile@lemmy.world 8 months ago
Sorry truth benchmarks not intellegence www.youtube.com/watch?v=efPrtcLdcdM
FiskFisk33@startrek.website 8 months ago
gpt3/4 are already trained on reddit data. Not reddit data exclusively, but there’s a lot of it in there.