Current AI systems can perform sophisticated analysis, engage in complex reasoning, and execute multi-step plans.
No, not really
Submitted 1 day ago by silence7@slrpnk.net to technology@lemmy.world
Current AI systems can perform sophisticated analysis, engage in complex reasoning, and execute multi-step plans.
No, not really
It can say it can, when asked by an investor. And really, what else matters?
Depends on what you’re calling AI. LLMs (and generative AI in general) are garbage for all those things, and most things in general (all things if you take their cost into account). Machine Learning and expert systems can do at least some of that.
I absolutely hate that generative AI is being marketed as though it’s deep learning instead of a fancy Markov chain. But I think I’ve lost the battle over that nomenclature.
This. I work at a medical computer vision company, and our system performs better, on average, than radiologists.
It still needs a human to catch the weird edge cases, but studies show humans plus our model have a super high accuracy rate and speed. It’s perfect because there’s a global radiologist shortage, so helping the radiologists we have go faster can save a lot of lives.
But people are bad at nuance. All AI is like LLMs -_-
Claude’s month as a shopkeeper offers a preview of our AI-augmented future that’s simultaneously promising and deeply weird.
Did the author have a stroke by the time they reached the end of writing the article? The mental gymnastics would be funny if it wasn’t terrifying.
Wouldn‘t be surprised if the author used AI too but then again bad or let‘s call it „weird“ journalism isn’t all that new.
I mean really, where do these legends come from? I have tried to make chatgpt sort through single document and present clear organized data, present in the document, into sorted table. It can’t reliably do that. How would it do any kind of complex task? That is just laughable.
I’m convinced that people who are fascinated by llm chatbots are those who usually aren’t better than a chatbot at whatever they do. That is to say, they can’t do shit.
Claude eventually resolved its existential crisis by convincing itself the whole episode had been an elaborate April Fool’s joke, which it wasn’t. The AI essentially gaslit itself back to functionality, which is either impressive or deeply concerning, depending on your perspective.
Now THAT’S some I, Robot shit. And I’m not talking about the Will Smith movie, I’m talking about the original book.
Can you talk about the movie too? I may be in the minority here but I enjoyed it.
The movie had themes about AI revolution, while the book was around robopsychology. Since this anecdote was about an AI gaslighting itself, it’s far more appropriate than the movie thematically.
This is by far the most interesting part. I want to know more about this, like why the author is so certain this wasn’t a joke.
For what its worth, Anthropic posted this in their corporate blog. So if its a joke, its coming out of vetted corporate PR.
That anyone would even attempt such an experiment shows a profound misunderstanding of what this tech is. It’s depressing how stupid people are.
It was Anthropic who ran this experiment
It doesn’t detract from the parent’s comment at all.
This is actually a very interesting article, the experiment demonstrates the current limitation of “AI” (so really just LLM) well. Most people (including investors and executives) have no idea what is the reality
Sure.
But someone offered it $100 for a six pack of Bru and it declined, and they’re taking this as a hilarious failure, because a real human would be a real scumbag and take the cash pretending it was the right amount. So it’s not capitalist-level evil yet.
Cyberpunk 2077 did a version of this on a side mission. It’s gets pulled for a similar reason.
Claude ran a vending machine business for a month, selling tungsten cubes
hmmm
I need a gif of a tungsten cube dropping from the top shelf of a vending machine and folding it in on itself
It was selling tungsten cubes to another AI who’s job was to restock the vending machine.
This is how you juice GDP.
This is how I know AI doesn’t really work. Give it a real use case in the physical world, it can’t be almost there, either it passes or fails.
People should really appreciate deterministic algorithm cause they could automate things in the real world
The physical world is too fast, relies on the speed of human brains calculating a million variables instantly, not mere pattern matching. See how hard it is to teach a robot to catch a ball. You have to input all the physics where a human doesn’t even consciously think on the problem.
Now we humans are best-in-class at pattern matching, but we often get it wrong and AI amplifies those mistakes.
AI can be great at certain tasks, but we have to be cognizant of how that works.
If the AI cannot run the business then we must conclude that the business does not produce anything of real value.
Nothing to do but downsize and move on.
I read some of the results a bit ago. One had what I can only describe as a full mental spasm and loss of reality, and seemed to become disturbed at it’s own existence, and another tried to contact… the FBI.
How will they protect the robots?
With tungsten cubes apparently. Lots and lots of tungsten cubes!
By pushing them down the stairs
slaacaa@lemmy.world 8 hours ago
“This matters because we’re rapidly approaching a world where AI systems will manage increasingly important decisions.”
How about we just don’t do that?
13igTyme@lemmy.world 7 hours ago
I work in Heath tech and we use Machine learning to create tools that help care managers and providers, but ultimately it’s still completely on the person to make important decisions. Our tool just helps you organize your day.
Prior_Industry@lemmy.world 7 hours ago
Feels like so much of the AI hype is smoke and mirrors to get investor money, give it another year everyone will be wondering how the bubble got so big and popped and how no one saw it coming.
That being said I don’t think it’s going away either, just that a lot of investor money is going to be lost chasing shadows.