Comment

Comment on ChatGPT spends 'tens of millions of dollars' on people saying 'please' and 'thank you', but Sam Altman says it's worth it

<- View Parent

Dran_Arcana@lemmy.world ⁨1⁩ ⁨year⁩ ago

Anecdotally, I use it a lot and I feel like my responses are better when I’m polite. I have a couple of theories as to why.

More tokens in the context window of your question, and a clear separator between ideas in a conversation make it easier for the inference tokenizer to recognize disparate ideas.
Higher quality datasets contain american boomer/millennial notions of “politeness” and when responses are structured in kind, they’re more likely to contain tokens from those higher quality datasets.

I haven’t mathematically proven any of this within the llama.cpp tokenizer, but I strongly suspect that I could at least prove a correlation between polite token input and dataset representation output tokens

source

Sort:hotnew top

tdawg@lemmy.world ⁨1⁩ ⁨year⁩ ago
Honestly they were better until recently. GPT (at least) has gotten really good at de-escalation and providing (mostly) factual responses when you get irate

source
spankmonkey@lemmy.world ⁨1⁩ ⁨year⁩ ago
It FEEEEEEEEEEEELS better is what the authors said too. Both articles were completely worthless dreck about how they felt about the responses.

source
- CileTheSane@lemmy.ca ⁨1⁩ ⁨year⁩ ago
  And you FEEEEEEEEL like it doesn’t matter. What’s the difference?
  
  source
- Dran_Arcana@lemmy.world ⁨1⁩ ⁨year⁩ ago
  Yes they were, so I’m offering you an actual theory as to why this may actually be true, yet difficult to “prove”.
  
  Smoking was bad for your health long before anyone sat down and took the time to prove it. Autoregressive LLM tokenizer are a very new field of computer science and it’s going to take a while for the community to collectively understand everything we’re currently doing by trial and error.
  
  source
  - homesweethomeMrL@lemmy.world ⁨1⁩ ⁨year⁩ ago
    And yet doctors saw the tar in the lungs and knew immediately.
    
    source
  - spankmonkey@lemmy.world ⁨1⁩ ⁨year⁩ ago
    Smoking was known to be bad for your health long before anyone did studies because it was easily correlated with coughing and other breathing issues and early death. The evidence was obvious and apparent.
    
    source
    Dark_Arc@social.packetloss.gg ⁨1⁩ ⁨year⁩ ago
    🤦‍♂️
    
    source