Can AI run a physical shop? Anthropic’s Claude tried and the results were gloriously, hilariously bad

Submitted ⁨⁨4⁩ ⁨months⁩ ago⁩ by ⁨silence7@slrpnk.net⁩ to ⁨technology@lemmy.world⁩

https://venturebeat.com/ai/can-ai-run-a-physical-shop-anthropics-claude-tried-and-the-results-were-gloriously-hilariously-bad/

source

Comments

Sort:hotnew top

treadful@lemmy.zip ⁨4⁩ ⁨months⁩ ago

Current AI systems can perform sophisticated analysis, engage in complex reasoning, and execute multi-step plans.

No, not really

source
- UnderpantsWeevil@lemmy.world ⁨4⁩ ⁨months⁩ ago
  It can say it can, when asked by an investor. And really, what else matters?
  
  source
- TexasDrunk@lemmy.world ⁨4⁩ ⁨months⁩ ago
  Depends on what you’re calling AI. LLMs (and generative AI in general) are garbage for all those things, and most things in general (all things if you take their cost into account). Machine Learning and expert systems can do at least some of that.
  
  I absolutely hate that generative AI is being marketed as though it’s deep learning instead of a fancy Markov chain. But I think I’ve lost the battle over that nomenclature.
  
  source
  - TheBeege@lemmy.world ⁨4⁩ ⁨months⁩ ago
    This. I work at a medical computer vision company, and our system performs better, on average, than radiologists.
    
    It still needs a human to catch the weird edge cases, but studies show humans plus our model have a super high accuracy rate and speed. It’s perfect because there’s a global radiologist shortage, so helping the radiologists we have go faster can save a lot of lives.
    
    But people are bad at nuance. All AI is like LLMs -_-
    
    source
    -> View More Comments
- SerotoninSwells@lemmy.world ⁨4⁩ ⁨months⁩ ago
  
  Claude’s month as a shopkeeper offers a preview of our AI-augmented future that’s simultaneously promising and deeply weird.
  
  Did the author have a stroke by the time they reached the end of writing the article? The mental gymnastics would be funny if it wasn’t terrifying.
  
  source
  - CosmoNova@lemmy.world ⁨4⁩ ⁨months⁩ ago
    Wouldn‘t be surprised if the author used AI too but then again bad or let‘s call it „weird“ journalism isn’t all that new.
    
    source
    -> View More Comments
- 14th_cylon@lemm.ee ⁨4⁩ ⁨months⁩ ago
  I mean really, where do these legends come from? I have tried to make chatgpt sort through single document and present clear organized data, present in the document, into sorted table. It can’t reliably do that. How would it do any kind of complex task? That is just laughable.
  
  source
  - Nalivai@lemmy.world ⁨4⁩ ⁨months⁩ ago
    I’m convinced that people who are fascinated by llm chatbots are those who usually aren’t better than a chatbot at whatever they do. That is to say, they can’t do shit.
    
    source
    -> View More Comments
pennomi@lemmy.world ⁨4⁩ ⁨months⁩ ago

Claude eventually resolved its existential crisis by convincing itself the whole episode had been an elaborate April Fool’s joke, which it wasn’t. The AI essentially gaslit itself back to functionality, which is either impressive or deeply concerning, depending on your perspective.

Now THAT’S some I, Robot shit. And I’m not talking about the Will Smith movie, I’m talking about the original book.

source
- Psythik@lemm.ee ⁨4⁩ ⁨months⁩ ago
  Can you talk about the movie too? I may be in the minority here but I enjoyed it.
  
  source
  - pennomi@lemmy.world ⁨4⁩ ⁨months⁩ ago
    The movie had themes about AI revolution, while the book was around robopsychology. Since this anecdote was about an AI gaslighting itself, it’s far more appropriate than the movie thematically.
    
    source
    -> View More Comments
- Havoc8154@mander.xyz ⁨4⁩ ⁨months⁩ ago
  This is by far the most interesting part. I want to know more about this, like why the author is so certain this wasn’t a joke.
  
  source
  - silence7@slrpnk.net ⁨4⁩ ⁨months⁩ ago
    For what its worth, Anthropic posted this in their corporate blog. So if its a joke, its coming out of vetted corporate PR.
    
    source
some_guy@lemmy.sdf.org ⁨4⁩ ⁨months⁩ ago
That anyone would even attempt such an experiment shows a profound misunderstanding of what this tech is. It’s depressing how stupid people are.

source
- andallthat@lemmy.world ⁨4⁩ ⁨months⁩ ago
  It was Anthropic who ran this experiment
  
  source
  - cley_faye@lemmy.world ⁨4⁩ ⁨months⁩ ago
    It doesn’t detract from the parent’s comment at all.
    
    source
slaacaa@lemmy.world ⁨4⁩ ⁨months⁩ ago
“This matters because we’re rapidly approaching a world where AI systems will manage increasingly important decisions.”

How about we just don’t do that?

source
- Prior_Industry@lemmy.world ⁨4⁩ ⁨months⁩ ago
  Feels like so much of the AI hype is smoke and mirrors to get investor money, give it another year everyone will be wondering how the bubble got so big and popped and how no one saw it coming.
  
  That being said I don’t think it’s going away either, just that a lot of investor money is going to be lost chasing shadows.
  
  source
  - Sturgist@lemmy.ca ⁨4⁩ ⁨months⁩ ago
    
    AI hype is smoke and mirrors
    
    Funny story. When I was in my early 20s myself and 2 friends had been on a major shroom trip, full day affair. We were on the last bus to our small town, and the only other person on the bus was a middle aged stoner.
    Just as we’re starting to get into town the stoner says:
    You guys wanna smoke a joint?
    
    Us: Sure, where you getting off?
    
    Stoner: Nah, we’ll just smoke it here at the back of the bus!
    
    Us: Aren’t you worried about getting kicked off the bus?
    
    Stoner: Nah man, they call it smoke and mirrors because you can’t see smoke in mirrors!
    
    Now, I was still fucked on mushrooms, but even I could tell this man had smoked himself within a razor’s edge of his last brain cell.
    
    source
- 13igTyme@lemmy.world ⁨4⁩ ⁨months⁩ ago
  I work in Heath tech and we use Machine learning to create tools that help care managers and providers, but ultimately it’s still completely on the person to make important decisions. Our tool just helps you organize your day.
  
  source
InternetCitizen2@lemmy.world ⁨4⁩ ⁨months⁩ ago

Claude ran a vending machine business for a month, selling tungsten cubes

hmmm

source
- unpossum@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
  as long as it’s not paper clips, we’re good
  
  source
  - AdamEatsAss@lemmy.world ⁨4⁩ ⁨months⁩ ago
    Clippy? I need help
    
    source
    -> View More Comments
- JPAKx4@lemmy.blahaj.zone ⁨4⁩ ⁨months⁩ ago
  I need a gif of a tungsten cube dropping from the top shelf of a vending machine and folding it in on itself
  
  source
- AdamEatsAss@lemmy.world ⁨4⁩ ⁨months⁩ ago
  It was selling tungsten cubes to another AI who’s job was to restock the vending machine.
  
  source
  - DarkDarkHouse@lemmy.sdf.org ⁨4⁩ ⁨months⁩ ago
    This is how you juice GDP.
    
    source
Grandwolf319@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
This is how I know AI doesn’t really work. Give it a real use case in the physical world, it can’t be almost there, either it passes or fails.

People should really appreciate deterministic algorithm cause they could automate things in the real world

source
- shalafi@lemmy.world ⁨4⁩ ⁨months⁩ ago
  The physical world is too fast, relies on the speed of human brains calculating a million variables instantly, not mere pattern matching. See how hard it is to teach a robot to catch a ball. You have to input all the physics where a human doesn’t even consciously think on the problem.
  
  Now we humans are best-in-class at pattern matching, but we often get it wrong and AI amplifies those mistakes.
  
  AI can be great at certain tasks, but we have to be cognizant of how that works.
  
  source
UnderpantsWeevil@lemmy.world ⁨4⁩ ⁨months⁩ ago
If the AI cannot run the business then we must conclude that the business does not produce anything of real value.

Nothing to do but downsize and move on.

source
slaacaa@lemmy.world ⁨4⁩ ⁨months⁩ ago
This is actually a very interesting article, the experiment demonstrates the current limitation of “AI” (so really just LLM) well. Most people (including investors and executives) have no idea what is the reality

source
moopet@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
Sure.

But someone offered it $100 for a six pack of Bru and it declined, and they’re taking this as a hilarious failure, because a real human would be a real scumbag and take the cash pretending it was the right amount. So it’s not capitalist-level evil yet.

source
badbytes@lemmy.world ⁨4⁩ ⁨months⁩ ago
How will they protect the robots?

source
- ThePowerOfGeek@lemmy.world ⁨4⁩ ⁨months⁩ ago
  With tungsten cubes apparently. Lots and lots of tungsten cubes!
  
  source
- blargle@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
  By pushing them down the stairs
  
  source
Zexks@lemmy.world ⁨4⁩ ⁨months⁩ ago
Cyberpunk 2077 did a version of this on a side mission. It’s gets pulled for a similar reason.

source
Zealousideal_Fox_900@lemmy.dbzer0.com ⁨4⁩ ⁨months⁩ ago
I read some of the results a bit ago. One had what I can only describe as a full mental spasm and loss of reality, and seemed to become disturbed at it’s own existence, and another tried to contact… the FBI.

source