Using Reddit’s popular ChangeMyView community as a source of baseline data, OpenAI had previously found that 2022’s ChatGPT-3.5 was significantly less persuasive than random humans, ranking in just the 38th percentile on this measure. But that performance jumped to the 77th percentile with September’s release of the o1-mini reasoning model and up to percentiles in the high 80s for the full-fledged o1 model.
So are you smarter than a Redditor?
rtxn@lemmy.world 1 week ago
That bar is so low it’s practically a tripping hazard in hell.
will_a113@lemmy.ml 1 week ago
😂