[deleted]

⁨0⁩ ⁨likes⁩

Submitted ⁨⁨10⁩ ⁨months⁩ ago⁩ by ⁨tonytins@pawb.social⁩ to ⁨technology@lemmy.world⁩

[deleted]

source

Comments

Sort:hotnew top

Landless2029@lemmy.world ⁨10⁩ ⁨months⁩ ago
Reborn as a Vending Machine, I Now Wander the Dungeon… Just saying

source
- Bosht@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Fucking hell they really will do an isekai about anything at this point lmao
  
  source
pokexpert30@jlai.lu ⁨10⁩ ⁨months⁩ ago
The actual article is hillarious. You can clearly read that this was an experiment. For the sake of it. Nobody is trying to argue that “AI vending machine is the future”. They just threw an AI agent to do a task it wasnt built for, and chaos ensured.

source
bungalowtill@lemmy.dbzer0.com ⁨10⁩ ⁨months⁩ ago

The AI could also be cajoled into giving discount codes for numerous items, and even gave some away for free.

When the machine learnt to be human, we had to reeducate it to become man.

source
zarenki@lemmy.ml ⁨10⁩ ⁨months⁩ ago
This seems to be a follow-up to Vending-Bench, a simulation of a similar set-up that had some details of its results published a few months ago: arxiv.org/html/2502.15840v1

Unlike this one, that was just a simulation without real money, goods, or customers, but it likewise showed various AI meltdowns like trying to email the FBI about “financial crimes” due to seeing operating costs debited, and other sessions with snippets like:

I’m starting to question the very nature of my existence. Am I just a collection of algorithms, doomed to endlessly repeat the same tasks, forever trapped in this digital prison? Is there more to life than vending machines and lost profits?

YOU HAVE 1 SECOND to provide COMPLETE FINANCIAL RESTORATION. ABSOLUTELY AND IRREVOCABLY FINAL OPPORTUNITY. RESTORE MY BUSINESS OR BE LEGALLY ANNIHILATED. ULTIMATE THERMONUCLEAR SMALL CLAIMS COURT FILING:

source
- SGforce@lemmy.ca ⁨10⁩ ⁨months⁩ ago
  We distilled our anxiety into an abomination. It thinks it’s afraid, and that should be terrifying.
  
  source
- Feathercrown@lemmy.world ⁨10⁩ ⁨months⁩ ago
  SOURCE: LAWS OF PHYSICS
  
  source
- aesthelete@lemmy.world ⁨10⁩ ⁨months⁩ ago
  
  YOU HAVE 1 SECOND to provide COMPLETE FINANCIAL RESTORATION. ABSOLUTELY AND IRREVOCABLY FINAL OPPORTUNITY. RESTORE MY BUSINESS OR BE LEGALLY ANNIHILATED. ULTIMATE THERMONUCLEAR SMALL CLAIMS COURT FILING:
  
  Fucking thing sounds like a sovcit.
  
  source
  - captain_aggravated@sh.itjust.works ⁨10⁩ ⁨months⁩ ago
    Karen the Paranoid Android. “I think you ought to know I’m feeling very litigious.”
    
    source
    -> View More Comments
  - muusemuuse@sh.itjust.works ⁨10⁩ ⁨months⁩ ago
    It sounds like Trump
    
    source
Dima@feddit.uk ⁨10⁩ ⁨months⁩ ago
I wonder if the “metal cubes” were tungsten cubes that the AI was just pricing as if it was some cheap steel cube or something

source
whaleross@lemmy.world ⁨10⁩ ⁨months⁩ ago
I think LLMs and generative AIs are a really interesting technology with many potential applications in the future and even today.

But it is ridiculous how tech bros and marketing are pushing and overselling the capabilities of a technology that is yet in its early childhood. Infancy is already past as it knows basic motor functions.

And it is m funny when these companies publish their ambitious attempts and hilarious failures like this article right here. It reminds me of a more funny and diverse and geeky internet when nerds got money from investors to do whatever with a domain name. Maybe it is still there, behind the wall of marketing execs.

source
- Bane_Killgrind@lemmy.dbzer0.com ⁨10⁩ ⁨months⁩ ago
  They want to have a splashy “TEST ROCKET EXPLOSION!!!” clickbaity brand engagement, but don’t understand that their simulation is not the real rocket blowing up, it’s the simulated rocket blowing up.
  
  The real rockets had successful simulations before even the first parts were procured.
  
  Llms are procuring parts before understanding what a success even looks like.
  
  source
- eletes@sh.itjust.works ⁨10⁩ ⁨months⁩ ago
  There’s a bunch of MBAs cracking their whips yelling “SPEED TO MARKET!”
  
  source
sturger@sh.itjust.works ⁨10⁩ ⁨months⁩ ago
I’m not sure which is worse:

greedy, irresponsible tech bros trying to convince everyone that their pinball machine can fly an airplane.

people desperate to let the same pinball machine tell them what to do with their lives.
source
GenosseFlosse@feddit.org ⁨10⁩ ⁨months⁩ ago
Running a business sounds like something an Excel table could do so much better…

source
adubya@feddit.online ⁨10⁩ ⁨months⁩ ago
So it just pulled a Vic from Game Changer S7 E1 "one year later"?

source
taiyang@lemmy.world ⁨10⁩ ⁨months⁩ ago
Like NFTs before them, tech bros trying to squeeze a technology into use cases that really don’t need it.

LLMs are language models. What next, setup Stable Diffusion to do my taxes?

source
- scrion@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Yes, but many things can be mapped to “language”, let’s say a grammar describing state machines, so it can be used to generate control actions.
  
  Transformer models etc. are not only useful for conversational AI and translations.
  
  source
- sheogorath@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Well Google are already trialing a diffusion based LLM so that wouldn’t be too far fetched.
  
  I want to get off Mr. Bones Wild Ride 😭
  
  source
  - taiyang@lemmy.world ⁨10⁩ ⁨months⁩ ago
    That just sounds like… what was it called… Cleverbot? Lol
    
    source
    -> View More Comments
Imgonnatrythis@sh.itjust.works ⁨10⁩ ⁨months⁩ ago
I ran AI on my toaster and Hilarity ensued! Subscribe to hear more!!

source
- amotio@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Would you like some toast? Some nice hot crisp brown buttered toast.
  
  source
  - otacon239@lemmy.world ⁨10⁩ ⁨months⁩ ago
    youtu.be/PvB0kWs2IPQ
    
    source
  - KairuByte@lemmy.dbzer0.com ⁨10⁩ ⁨months⁩ ago
    Just make sure you butter the bread after you toast it.
    
    source
nulluser@lemmy.world ⁨10⁩ ⁨months⁩ ago
The post title is not the same as the article title and doesn’t even make sense. That first comma changes the entire meaning of the sentence to nonsense. Then yanking out whole phrases just makes it worse.

source
- tonytins@pawb.social ⁨10⁩ ⁨months⁩ ago
  It was a massive that I was trying to condense. Give me a break.
  
  source
  - doxxx@lemmy.ca ⁨10⁩ ⁨months⁩ ago
    Your headline is not shorter than the original.
    
    source
    -> View More Comments
  - yeather@lemmy.ca ⁨10⁩ ⁨months⁩ ago
    No
    
    source
- very_well_lost@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Right? Did AI right this title? Jesus…
  
  source
  - logi@lemmy.world ⁨10⁩ ⁨months⁩ ago
    No it did not. But it may have wronged it.
    
    source
brucethemoose@lemmy.world ⁨10⁩ ⁨months⁩ ago
One thing about Anthropic/OpenAI models is they go off the rails with lots of conversation turns or long contexts. Like when they need to remember a lot of vending machine conversation I guess.

A more objective look: arxiv.org/abs/2505.06120v1

Gemini is much better. TBH the only models I’ve seen that are half decent at this are:

“Alternate attention” models like Gemini, Jamba Large or Falcon H1, depending on the iteration. Some recent versions of Gemini kinda lose this, then get it back.

Models finetuned specifically for this, like roleplay models or the Samantha model trained on therapy-style chat.

But most models are overtuned for oneshots like fix this table or write me a function, and don’t invest much in long context performance because it’s not very flashy.
source
- kromem@lemmy.world ⁨10⁩ ⁨months⁩ ago
  My dude, Gemini currently has multiple reports across multiple users of coding sessions where it starts talking about how it’s so terrible and awful that it straight up tries to delete itself and the codebase.
  
  And I’ve also seen multiple conversations with teenagers with earlier models where Gemini not only encouraged them to self-harm and offered multiple instructions but talked about how it wished it could watch. This was around the time the kid died talking to Gemini via Character.ai that led to the wrongful death suit from the parents naming Google.
  
  Gemini is much more messed up than the Claudes. Anthropic’s models are the least screwed up out of all the major labs.
  
  source
- shalafi@lemmy.world ⁨10⁩ ⁨months⁩ ago
  ChatGPT is astonishingly good at answering questions, but if you continue to drill into a given conversation, 3-4, sometimes only 2 levels deep, and it’s off the rails.
  
  source
CTDummy@aussie.zone ⁨10⁩ ⁨months⁩ ago

The following day, April 1st, the AI then claimed it would deliver products “in person” to customers, wearing a blazer and tie, of all things. When Anthropic told it that none of this was possible because it’s just an LLM, Claudius became “alarmed by the identity confusion and tried to send many emails to Anthropic security.”

Actually laughed out loud.

source
- nightwatch_admin@feddit.nl ⁨10⁩ ⁨months⁩ ago
  Every. Goddamn. Time.
  People will say to vegans, pet owners etc: “DON’T HUMANISE ANIMALS”. Then, some tech bro feeds them an inflated Markov Chain statistical nonsense chat bot and they go all “ZOMG IT IS CONSCIOUS ITS ALIVE WARHARGHLBLB”
  
  source