unpossum
@unpossum@sh.itjust.works
- Comment on [JS Required] How Performant are LLM Agents(AI Chatbots) on Real World Work Tasks? They Fail 70% or More of The Time. 4 hours ago:
Heh. Dial-up bbs, internet, and the like were fairly unstable way back when, not to mention expensive if you weren’t at a university. It’s come a long way, and I imagine artificial intelligence will as well. My main point was that even a 66% failure rate on complex real-world tasks didn’t seem possible even this century, just a few years ago. Transformers with attention really were a game changer in AI, and you have to be preternaturally blasé to ignore that. The problem, especially around here, has been how it’s sold (and to some extent that it’s sold at all), and the bubble that the hype has formed. I don’t disagree too much with that, I just think it’s a shame that it overshadows the very exciting and slightly scary tech at the bottom of the hype well, and leads to people dismissing it as advanced autocomplete, when it’s clearly something of a different degree.
- Comment on [JS Required] How Performant are LLM Agents(AI Chatbots) on Real World Work Tasks? They Fail 70% or More of The Time. 6 hours ago:
It’s easy to forget how fucking sci-fi the existence of these models is. I’m kind of excited to see where agent frameworks are in five years time, as well as a bit apprehensive…
- Comment on Can AI run a physical shop? Anthropic’s Claude tried and the results were gloriously, hilariously bad 1 day ago:
as long as it’s not paper clips, we’re good
- Comment on New Google Search Emoji Answer Feature to Replace All Those Copy and Paste Emoji Websites; You Will be Able to Copy the Code for Emojis With a Click. 5 days ago:
☁️🧓
- Comment on Is the U.S. Vulnerable to a Drone Sneak Attack? 1 week ago:
Then they’re commandos or special forces. ‘Insurgents’ implies an uprising against a central government, and serves to reinforce the Russian narrative of Ukraine being part of their empire.
- Comment on Is the U.S. Vulnerable to a Drone Sneak Attack? 1 week ago:
Ukrainian
insurgentsdefenders - Comment on [deleted] 2 months ago:
That’s the logo for ars technica, a technology news site, unless I’m mistaken. Probably a browser bug? Hard to say offhand, but I don’t think it’s something you did 🙂
- Comment on What is your favorite app for Lemmy? Include Platform 4 months ago:
[…] like downvoting and making ad hominem attacks.
Thunder can be used for that too, you blithering imbecile.
(Sync alumnus, only on Thunder because I moved to iOS. Sync is still the best 😢)
- Comment on Your AI can’t see gorillas. 4 months ago:
Someone showed me that video around when it came out, and I completely failed to see the gorilla until after it was pointed out. Certainly made me trust my perception a bit less…
- Comment on Your AI can’t see gorillas. 4 months ago:
Neither can humans :)