Comment on How Much Do LLMs Hallucinate in Document Q&A Scenarios? A 172-Billion-Token Study Across Temperatures, Context Lengths, and Hardware Platforms [TLDR: 25%]

<- View Parent
HubertManne@piefed.social ⁨1⁩ ⁨week⁩ ago

see if they don’t at all then they can fall victim to thinking they are better than they are. By using it a bit in something unimportant which you are knowledgable enough about it allows you to see the flaws and it does not take that much time to see them.

source
Sort:hotnewtop