Comments

Sort:hotnew top

mrductape@eviltoast.org ⁨5⁩ ⁨months⁩ ago
Well, it’s almost correct. It’s just one letter off. Maybe if we invest millions more it will be right next time.

Or maybe it is just not accurate and never will be…I will not every fully trust AI. I’m sure there are use cases for it, I just don’t have any.

source
- TheFogan@programming.dev ⁨5⁩ ⁨months⁩ ago
  Cases where you want something googled quickly to get an answer, and it’s low consequence when the answer is wrong.
  
  IE, say a bar arguement over whether that guy was in that movie. Or you need a customer service agent, but don’t actually care about your customers and don’t want to pay someone.
  
  source
  - elvith@feddit.org ⁨5⁩ ⁨months⁩ ago
    How it started:
    
    Or you need a customer service agent, but don’t actually care about your customers and don’t want to pay someone
    
    How it’s going:
    
    IKEA
    
    Chevy
    
    …
    
    source
    -> View More Comments
  - atopi@piefed.blahaj.zone ⁨5⁩ ⁨months⁩ ago
    Isnt checking if someone was in a movie really easy to do without AI?
    
    source
- kilgore_trout@feddit.it ⁨5⁩ ⁨months⁩ ago
  Just one more private nuclear power plant, bro…
  
  source
  - anomnom@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
    They’re using oil, gas, and if Trump gets his way, fucking coal.
    
    Unless you count Three Mile Island.
    
    source
    -> View More Comments
Djehngo@lemmy.world ⁨5⁩ ⁨months⁩ ago
The letters that make up words is a common blind spot for AIs, since they are trained on strings of tokens (roughly words) they don’t have a good concept of which letters are inside those words or what order they are in.

source
- PixelatedSaturn@lemmy.world ⁨5⁩ ⁨months⁩ ago
  I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.
  
  source
  - skisnow@lemmy.ca ⁨5⁩ ⁨months⁩ ago
    Not bizarre at all.
    
    The point isn’t “they can’t do word games therefore they’re useless”, it’s “if this thing is so easily tripped up on the most trivial shit that a 6-year-old can figure out, don’t be going round claiming it has PhD level expertise”.
    
    source
    -> View More Comments
  - knatschus@discuss.tchncs.de ⁨5⁩ ⁨months⁩ ago
    Then why is Google using it for question like that?
    
    Surely it should be advanced enough to realise it’s weakness with this kind of questions and just don’t give an answer.
    
    source
    -> View More Comments
  - echodot@feddit.uk ⁨5⁩ ⁨months⁩ ago
    Well it also can’t code very well either
    
    source
    -> View More Comments
  - figjam@midwest.social ⁨5⁩ ⁨months⁩ ago
    Understanding the bounds of tech makes it easier for people to gage its utility. The only people who desire ignorance are those that profit from it.
    
    source
    -> View More Comments
  - EnsignWashout@startrek.website ⁨5⁩ ⁨months⁩ ago
    
    I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.
    
    This reaction is because conmen are claiming that current generations of LLM technology is going to remove our need for experts and scientists.
    
    We’re not demanding submersible cars, we’re just laughing about the people paying top dollar for the lastest electric car while plannig an ocean cruise.
    
    I’m confident that there’s going to be a great deal of broken… everything…built with AI “assistance” during the next decade.
    
    source
    -> View More Comments
  - mrductape@eviltoast.org ⁨5⁩ ⁨months⁩ ago
    Well technically cars can go underwater. They just cannot get out because they stop working.
    
    source
    -> View More Comments
- azertyfun@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  It’s very funny that you can get ChaptGPT to spell out the word (making each letter an individual token) and still be wrong.
  
  Of course it makes complete sense when you know how LLMs work, but this demo does a very concise job of short-circuiting the cognitive bias that talking machine == thinking machine.
  
  source
ilinamorato@lemmy.world ⁨5⁩ ⁨months⁩ ago
✅ Colorado

✅ Connedicut

✅ Delaware

❌ District of Columbia (on a technicality)

✅ Florida

But not

❌ I’aho

❌ Iniana

❌ Marylan

❌ Nevaa

❌ North Akota

❌ Rhoe Islan

❌ South Akota

source
- individual@toast.ooo ⁨5⁩ ⁨months⁩ ago
  Gosh tier comment.
  
  source
  - ilinamorato@lemmy.world ⁨5⁩ ⁨months⁩ ago
    You just described most of my post history.
    
    source
- boonhet@sopuli.xyz ⁨5⁩ ⁨months⁩ ago
  Everyone knows it’s properly spelled “I, the ho” not Idaho. That’s why it didn’t make the list.
  
  source
echodot@feddit.uk ⁨5⁩ ⁨months⁩ ago
You joke, but I bet you didn’t know that Connecticut contained a “d”

I wonder what other words contain letters we don’t know about.

source
- Rcklsabndn@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  The famous ‘invisible D’ of Connecticut, my favorite SCP.
  
  source
  - kuberoot@discuss.tchncs.de ⁨5⁩ ⁨months⁩ ago
    That actually sounds like a fun SCP - a word that doesn’t seem to contain a letter, but when testing for the presence of that letter using an algorithm that exclusively checks for that presence, it reports the letter is indeed present. Any attempt to check where in the word the letter is, or to get a list of all letters in that word, spuriously fail. Containment could be fun, probably involving amnestics and widespread societal influence, I also wonder if they could create an algorithm for checking letter presence that can be performed by hand without leaking any other information to the person performing it, reproducing the anomaly without computers.
    
    source
  - iamdefinitelyoverthirteen@lemmy.world ⁨5⁩ ⁨months⁩ ago
    ct -> d is a not-uncommon OCR fuck up. Maybe that’s the source of it’s garbage data?
    
    source
    -> View More Comments
  - ICastFist@programming.dev ⁨5⁩ ⁨months⁩ ago
    SCP-00WTFDoC (lovingly called “where’s the fucking D of Connecticut” by the foundation workers, also “what the fuck, doc?”)
    
    People think it’s safe, because it’s “just an invisible D”, not even a dick, just the letter D, and it only manifests verbally when someone tries to say “connecticut” or write it down. When you least expect it, everyone heard “Donnedtidut”, everyone read that thing and a portal to that fucking place opens and drags you in.
    
    source
  - ripcord@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Words are full of mystery! Besides the invisible D, Connecticut has that inaudible C…
    
    source
    -> View More Comments
- villainy@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Every American I know does pronounce it like Connedicut 🤔
  
  source
  - Corkyskog@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
    Really? Everyone I know calls it kinetic-cut. But I group up in new england.
    
    source
    -> View More Comments
- jaupsinluggies@feddit.uk ⁨5⁩ ⁨months⁩ ago
  The d in Connecticut is between the e and the i. They don’t connect because it was cut.
  
  source
  - Uruanna@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Connecticut is Jewish?
    
    source
- OrteilGenou@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Connedicut
  
  source
  - Aneb@lemmy.world ⁨5⁩ ⁨months⁩ ago
    I was going to make a joke if you’re from connedicut you never pronounce first d in the word. Conne-icut
    
    source
Multiplexer@discuss.tchncs.de ⁨5⁩ ⁨months⁩ ago
No, this is Google throwing the cheapest possible shit at you that is barely capable of stringing together 5 coherent sentences but has the reasoning capability of a tapeworm.

Here is the output of the output of the minimalist open Chinese model Qwen3, that runs locally on my 6 year old mid-end PC:

The US states that contain the letter "d" (case-insensitive, as state names are typically capitalized) are as follows. I've verified each state name for the presence of the letter "d" in its standard spelling: Colorado (contains "d" in "Colorado") Delaware (starts with "D") Florida (contains "d" in "Florida") Idaho (contains "d" in "Idaho") Indiana (contains "d" in "Indiana") Maryland (ends with "d" in "Maryland") Nevada (contains "d" in "Nevada") North Dakota (contains "d" in "Dakota") Rhode Island (contains "d" in "Rhode") South Dakota (contains "d" in "Dakota") Total: 10 states.
source
- Rcklsabndn@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  Illinois contains a hidden D which is in your mom.
  
  source
  - Multiplexer@discuss.tchncs.de ⁨5⁩ ⁨months⁩ ago
    I didn’t understand your comment, so I asked the same LLM as before.
    It explained it and I think that I get it now. Low-grade middle-school-“Your Mom”-joke, is it? Ha-ha… 🙄
    
    This also means that AI did better than myself at both tasks I’ve given it today (I found only 9 states with “d” when going over the state-list myself…).
    
    Whatever. I’m gonna have second lunch now.
    
    source
    -> View More Comments
- FauxLiving@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Exactly.
  
  The model that responds to your search query is designed to be cheap, not accurate. It has to generate an answer to every single search issued to Google. They’re not using high parameter models with reasoning because those would be ruinously expensive.
  
  source
Jankatarch@lemmy.world ⁨5⁩ ⁨months⁩ ago
They took money from cancer reaearch programs to fund this.

source
- Burninator05@lemmy.world ⁨5⁩ ⁨months⁩ ago
  After we pump another hundred trillion dollars and half the electricity generated globally into AI you’re going to feel pretty foolish for this comment.
  
  source
  - veni_vedi_veni@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Just a couple billion more parameters, bro, I swear, it will replace all the workers
    
    CEOs
    
    source
- jumping_redditor@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  only cancer patients benefit from cancer research, CEOs benefit from AI
  
  source
  - Jankatarch@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Tbf cancer patients benefit from AI too tho a completely different type that’s not really related to LLM chatbot AI girlfriend technology used in these.
    
    source
- kreskin@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Well as long as we still have enough money to buy weapons for that one particular filthy country in the middle east, we’re fine.
  
  source
dude@lemmings.world ⁨5⁩ ⁨months⁩ ago
Well, for anyone who knows a bit about how LLMs work, it’s pretty obvious why LLMs struggle with identifying the letters in the words

source
- BritishJ@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Well go on…
  
  source
  - JustTesting@lemmy.hogru.ch ⁨5⁩ ⁨months⁩ ago
    They don’t look at it letter by letter but in tokens, which are automatically generated separately based on occurrence. So while ‘z’ could be it’s own token, ‘ne’ or even ‘the’ could be treated as a single token vector. of course, ‘e’ would still be a separate token when it occurs in isolation. You could even have ‘le’ and ‘let’ as separate tokens, afaik. And each token is just a vector of numbers, like 300 or 1000 numbers that represent that token in a vector space. So ‘de’ and ‘e’ could be completely different and dissimilar vectors.
    
    so ‘delaware’ could look to an llm more like de-la-w-are or similar.
    
    of course you could train it to figure out letter counts based on those tokens with a lot of training data, though that could lower performance on other tasks and counting letters just isn’t that important, i guess, compared to other stuff
    
    source
    -> View More Comments
  - Gladaed@feddit.org ⁨5⁩ ⁨months⁩ ago
    Which is State contains 狄? They use a different alphabet, so understanding ours is ridiculous.
    
    source
sqgl@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
ChatGPT is just as stupid.Image

source
- SaveTheTuaHawk@lemmy.ca ⁨5⁩ ⁨months⁩ ago
  it’s actually getting dumber.
  
  source
Blackmist@feddit.uk ⁨5⁩ ⁨months⁩ ago
Just another trillion, bro.

source
- NateNate60@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Just another 1.21 jigawatts of electricity, bro. If we get this new coal plant up and running, it’ll be enough.
  
  source
- Tryenjer@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Behold the most expensive money burner!
  
  source
panda_abyss@lemmy.ca ⁨5⁩ ⁨months⁩ ago
Yesterday i asked Claude Sonnet what was on my calendar (since they just annoyed that feature)

It listed my work meetings on Sunday, so I tried to correct it…

You’re absolutely right - I made an error! September 15th is a Sunday, not a weekend day as I implied. Let me correct that: This Week’s Remaining Schedule: Sunday, September 15

Just today when I asked what’s on my calendar it gave me today and my meetings on the next two thursdays. Not the meetings in between, just thursdays.

Something is off in AI land.

source
- FlashMobOfOne@lemmy.world ⁨5⁩ ⁨months⁩ ago
  A few weeks ago my Pixel wished me a Happy Birthday when I woke up, and it definitely was not my birthday. Google is definitely letting a shitty LLM write code for it now, but the important thing is they’re bypassing human validation.
  
  Stupid. Just stupid.
  
  source
  - python@lemmy.world ⁨5⁩ ⁨months⁩ ago
    pixel? ~~have you heard ~about grapheneOS tho…~~~
    
    source
- achance4cheese@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  Also, Sunday September 15th is a Monday… I’ve seen so many meeting invites with dates and days that don’t match lately…
  
  source
  - panda_abyss@lemmy.ca ⁨5⁩ ⁨months⁩ ago
    Yeah, it said Sunday, I asked if it was sure, then it said I’m right and went back to Sunday.
    
    I assume the training data has the model think it’s a different year or something, but this feature is straight up not working at all for me. I don’t know if they actually tested this at all.
    
    Sonnet seems to have gotten stupider somehow.
    
    Opus isn’t following instructions lately either.
    
    source
- MangoCats@feddit.it ⁨5⁩ ⁨months⁩ ago
  We’ve used the Google AI speakers in the house for years, they make all kinds of hilarious mistakes. They also are pretty convenient and reliable for setting and executing alarms like “7AM weekdays”, and home automation commands like “all lights off”. But otherwise, it’s hit and miss and very frustrating when they push an update that breaks things that used to work.
  
  source
kiku@feddit.org ⁨5⁩ ⁨months⁩ ago
Also verified

Image

source
- threeonefour@piefed.ca ⁨5⁩ ⁨months⁩ ago
  Wait a sec, Minnasoda doesn't have a d??
  
  source
  - dumbass@leminal.space ⁨5⁩ ⁨months⁩ ago
    That’s how everyone from America seems to say it, besides Jesse Ventura who heavily emphasises the t.
    
    source
    -> View More Comments
  - SpaceNoodle@lemmy.world ⁨5⁩ ⁨months⁩ ago
    *mini soda
    
    source
  - lugal@lemmy.dbzer0.com ⁨5⁩ ⁨months⁩ ago
    Neither does soda
    
    source
- sugar_in_your_tea@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  Where’s Nevada? And Montana?
  
  source
  - hobovision@mander.xyz ⁨5⁩ ⁨months⁩ ago
    I just love the d in Montana. Shame it missed it.
    
    source
ArsonButCute@lemmy.dbzer0.com ⁨5⁩ ⁨months⁩ ago
Hey look the markov chain showed its biggest weakness (the markov chain)!

In the training data, it could be assumed by output that Connecticut usually follows Colorado in lists of two or more states containing Colorado. There is no other reason for this to occur as far as I know.

Markov Chain based LLMs (I think thats all of them?) are dice-roll systems constrained to probability maps.

source
- AlecSadler@lemmy.blahaj.zone ⁨5⁩ ⁨months⁩ ago
  Oh l I was thinking it’s because people pronounce it Connedicut
  
  source
  - ArsonButCute@lemmy.dbzer0.com ⁨5⁩ ⁨months⁩ ago
    Awe cute!
    
    source
- ramjambamalam@lemmy.ca ⁨5⁩ ⁨months⁩ ago
  I was wondering if you’d get similar results for states with the letter R, since there’s lots of prior art mentioning these states as either “D” or “R” during elections.
  
  source
skisnow@lemmy.ca ⁨5⁩ ⁨months⁩ ago
I don’t think this gets nearly enough visibility: www.academ-ai.info

Papers in peer-reviewed journals with (extremely strong) evidence of AI shenanigans.

source
- beveradb@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  Thanks for sharing! I clicked on it with cynicism around how easily we could detect AI usage with confidence vs. risking making false allegations, but every single example on their homepage is super clear and I have no doubts - I’m impressed! (and disappointed)
  
  source
  - skisnow@lemmy.ca ⁨5⁩ ⁨months⁩ ago
    Yup. I had exactly the same trepidation, and then it was all like “As an AI model, I don’t have access to the data you requested, however here are some examples of…”
    
    source
gilokee@lemmy.world ⁨5⁩ ⁨months⁩ ago
Image

mine’s even worse somehow

source
- kilgore_trout@feddit.it ⁨5⁩ ⁨months⁩ ago
  You gave a slightly different prompt.
  
  source
  - sepi@piefed.social ⁨5⁩ ⁨months⁩ ago
    the thing still gave a stupid answer
    
    source
dumbass@leminal.space ⁨5⁩ ⁨months⁩ ago
Gemini is just a depressed and suicidal AI, be nice to it.

I had it completely melt down one day while messing around with its coding shit, I had to console it and tell it it’s doing good, we will solve this, was fucking weird as fuck.

source
- Vanilla_PuddinFudge@infosec.pub ⁨5⁩ ⁨months⁩ ago
  It’ll go in endless circles until it finds out why its wrong,
  
  then it will go right back to them anyway! lol
  
  source
Arghblarg@lemmy.ca ⁨5⁩ ⁨months⁩ ago
“AI” hallucinations are not a problem that can be fixed in LLMs. They are an inherent aspect of the process and an inevitable result of the fact that LLMs are mostly probabilistic engines, with no supervisory or introspective capability, which actual sentient beings possess and use to fact-check their output. So there. :p

source
- sexybenfranklin@ttrpg.network ⁨5⁩ ⁨months⁩ ago
  It’s funny seeing the list and knowing connecticut is only there because it’s alphabetically after colorado (in fact all four listed appear in that order alphabetically) because they probably scraped so many lists of states that the alphabetical order is the statistically most probable response in their corpus when any state name is listed.
  
  source
- Zwuzelmaus@feddit.org ⁨5⁩ ⁨months⁩ ago
  
  inevitable result of the fact that LLMs are mostly probabilistic engines
  
  So we should better put the question like
  
  “What is the probability of a D suddenly appearing in Connecticut?”
  
  source
FreedomAdvocate@lemmy.net.au ⁨5⁩ ⁨months⁩ ago
Gemini is trained on reddit data, what do you expect?

source
BlueMagma@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
I get the sentiment behind this post, and it’s almost always funny when LLM are such dumbass. But this is not a good argument against the technology. It is akin to climate change denier using the argument: “look! It snowed today, climate change is so dumb huh ?”

source
Aceticon@lemmy.dbzer0.com ⁨5⁩ ⁨months⁩ ago
“This is the technology worth trillions of dollars”

You can make anything fly high in the sky with enough helium, just not for long. (Welcome to the present day Tech Stock Market)

source
samus12345@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
Connedicut

source
SaveTheTuaHawk@lemmy.ca ⁨5⁩ ⁨months⁩ ago
We’re turfing out students by the tens on academic misconduct. They are handing in papers with references that clearly state “generated by Chat GPT”. Lazy idiots.

source
Jaysyn@lemmy.world ⁨5⁩ ⁨months⁩ ago
Blows my mind people pay money for wrong answers.

source
resipsaloquitur@lemmy.world ⁨5⁩ ⁨months⁩ ago
Listen, we just have to boil the ocean five more times.

Then it will hallucinate slightly less.

Or more. There’s no way to be sure since it’s probabilistic.

source
Jordan117@lemmy.world ⁨5⁩ ⁨months⁩ ago
One of these days AI skeptics will grasp that spelling-based mistakes are an artifact of text tokenization, not some wild stupidity in the model. But today is not that day.

source
chaosCruiser@futurology.today ⁨5⁩ ⁨months⁩ ago
In Copilot terminology, this is a “quick response” instead of the “think deeper” option. The latter actually stops to verify the initial answer before spitting it out.

source
Deestan@lemmy.world ⁨5⁩ ⁨months⁩ ago
Hey hey hey hey don’t look at what it actually does.

Look at what it feels like it almost can do and pretend it soon will!

source
ApeNo1@lemmy.world ⁨5⁩ ⁨months⁩ ago
“What did you learn at school today champ?”

“D is for cookie, that’s good enough for me
Oh, cookie, cookie, cookie starts with D”

AI Education for American Youth

source
Yaztromo@lemmy.world ⁨5⁩ ⁨months⁩ ago
GitLab Enterprise somewhat recently added support for Amazon Q (based on claude) through an interface they call “GitLab Duo”. I needed to look up something in the GitLab docs, but thought I’d ask Duo/Q instead (the UI has this big button in the top left of every screen to bring up Duo to chat with Q):

(Paraphrasing…)

ME: How do I do X with Amazon Q in GitLab? Q: Open the Amazon Q menu in the GitLab UI and select the appropriate option.

ME: [:looks for the non-existant menu:] ME: Where in the UI do I find this menu?

Q: My last response was incorrect. There is no Amazon Q button in GitLab. In fact, there is no integration between GitLab and Amazon Q at all.

ME: [:facepalm:]

source
Mrkawfee@lemmy.world ⁨5⁩ ⁨months⁩ ago
You don’t get it because you aren’t a genius. This chatbot has clearly turned sentient and is trolling you.

source
Kolanaki@pawb.social ⁨5⁩ ⁨months⁩ ago
Connecdicud.

source

-> View More Comments