Comment

Comment on How Much Do LLMs Hallucinate in Document Q&A Scenarios? A 172-Billion-Token Study Across Temperatures, Context Lengths, and Hardware Platforms [TLDR: 25%]

FauxLiving@lemmy.world ⁨2⁩ ⁨months⁩ ago

At 32K, the best model (GLM 4.5) fabricates 1.19% of answers

Not bad, I don’t know many people who are 98.81% accurate in their statements.

source

Sort:hotnew top

snooggums@piefed.world ⁨2⁩ ⁨months⁩ ago
Calculators are correct 100% of the time.

source
- FauxLiving@lemmy.world ⁨2⁩ ⁨months⁩ ago
  Calculators are not people, Mr. <1.19%.
  
  source
  - snooggums@piefed.world ⁨2⁩ ⁨months⁩ ago
    That’s right! We should be comparing computers to computers. We’ll, hardware computers, not people computers.
    
    source
    FauxLiving@lemmy.world ⁨2⁩ ⁨months⁩ ago
    Calculators are not computers, computers contain calculator-like elements but a calculator is no more a computer than a passenger jet is a coffee shop by virtue of having a coffee pot onboard.
    
    Calculators cannot fabricate answers, but nor are they 100% correct due to things like bitflips and square root approximations. They also cannot write text, so the comparison would make even less sense.
    
    LLMs and Humans can fabricate answers in written text so comparing the fabrication rate in written text of an LLM to a human (both entities which generate their answers with neural networks) makes more sense than to compare either to a calculator which neither uses a neural network or produces text.
    
    So ‘we’ should compare like things and not choose items based on superficial similarities.
    
    source
    ji59@hilariouschaos.com ⁨2⁩ ⁨months⁩ ago
    What do you even mean? Calculators and LLMs are solving different problems. And there are a lot of calculators and a lot of LLMs. Also, calculator accuracy could be approaching 0% because they all have limited precision and there are infinite numbers. Some of the calculators even can’t correctly answer 0.1+0.2, while most LLMs can do that.
    
    source
Iconoclast@feddit.uk ⁨2⁩ ⁨months⁩ ago
It’s a pleasure to meet you! The only thing exceeding my wisdom is my modesty.

source
- FauxLiving@lemmy.world ⁨2⁩ ⁨months⁩ ago
  Truly the most humble person of all time.
  
  source
Lemming6969@lemmy.world ⁨2⁩ ⁨months⁩ ago
You can be wrong and not fabricate. This is closer to human intentional lying.

source