Comment on Largest study of its kind shows AI assistants misrepresent news content 45% of the time – regardless of language or territory

jordanlund@lemmy.world ⁨2⁩ ⁨days⁩ ago

I wish they had broke it out by AI. The article states:

“Gemini performed worst with significant issues in 76% of responses, more than double the other assistants, largely due to its poor sourcing performance.”

But I don’t see that anywhere in the linked PDF of the “full results”.

This sort of study should also be re-done from time to time to track AI version numbers.

source
Sort:hotnewtop