Reddit is actually pretty decent for training llms. Funny enough an ai finetuned on 4chan does better in intelegence benchmarks.
Comment on Reddit Signs AI Content Licensing Deal Ahead of IPO
BlueEther@no.lastname.nz 1 year ago
I’ve been on reddit, I don’t know that I would like to use a LLM trained on much of the content there (excluding tech/DIY space)
muntedcrocodile@lemmy.world 1 year ago
Gonkulator@lemm.ee 1 year ago
“Intelegence”. Oh the irony.
lvxferre@mander.xyz 1 year ago
“Finetuned”, “Intelegence”. Oh the irony.
Focus on what is being said, not how it is said. The comment is silly but its usage of non-standard spelling has jack shit to do with it.
Gonkulator@lemm.ee 1 year ago
No thanks. Im going to go ahead and focus on what I choose. But thanks for your input.
muntedcrocodile@lemmy.world 1 year ago
Your unwarranted fixation on spelling in an online forum blatantly exposes your glaring dearth of insight beyond superficiality, a trait that most likely mirrors the shallowness dwelling within you.
pineapplepizza@lemm.ee 1 year ago
Source? Or BS?
muntedcrocodile@lemmy.world 1 year ago
Sorry truth benchmarks not intellegence www.youtube.com/watch?v=efPrtcLdcdM
FiskFisk33@startrek.website 1 year ago
gpt3/4 are already trained on reddit data. Not reddit data exclusively, but there’s a lot of it in there.