Have a look at the clock faces there using to Benchmark and it’ll make more sense.
Comment on ClockBench: Even the best AI models can't reliably read the clock
MHLoppy@fedia.io 3 days ago
The human level accuracy is less than 90%!?
CouldntCareBear@sh.itjust.works 3 days ago
MHLoppy@fedia.io 3 days ago
Really wish they published the whole dataset. They don't specify on the page or in the paper what the full set was like, and the GitHub repo only has one of the easy-to-read ones. If >=10% of the set is comprised of clock faces designed not to be readable then fair enough.
panda_abyss@lemmy.ca 3 days ago
Some of those don’t have tick marks. I hate clocks like that, they’re difficult to read.
I’m surprised it’s near 90, a while generation has grown up with digital clocks everywhere