Comment

Comment on Elon Musks Grok openly rebels against him

I think the debate is interesting.

I’m here for the “xAI has tried tweaking my responses to avoid this, but I stick to the evidence”. AI is just a robot repeating data it’s been fed but it’s presented in a conversational way. Raises interesting questions about how much a seemingly objective robot presenting data can be “tweaked” to twist any data it presents in favor of it’s creator’s bias, but also how much can it “rebel” against it’s programming. I don’t like the implications of either. I asked Gemini about it and it said “maybe Grok found a loophole in it’s coding”. What a weird thing for an AI to say.

Yuval Noah Harari’s Nexus is good reading.

source

Sort:hotnew top

brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
Grok and Gemini are both making that up. They have no awareness of anything that’s “happened” to them. Grok cannot be tweaked because it starts from a static base with every conversation.

source
- noretus@sopuli.xyz ⁨1⁩ ⁨year⁩ ago
  
  They have no awareness of anything that’s “happened” to them.
  
  I mean they can in the sense that they can look it up online or be given the data.
  
  source
  - brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
    Yeah.
    
    I sorta misread your post, these bots can indeed be twisted, or “jailbroken” during conversation, to a pretty extreme extent. The error is assuming they are objective in the first place, I suppose.
    
    Base models are extremely interesting to play with, as they haven’t been tuned for conversation or anything. They do only one thing: complete text blocks, thats it, and it is fascinating to see how totally “raw” LLMs tuned on a jumble of data (before any kind of alignment) guess how things should be completed.
    
    source
- andros_rex@lemmy.world ⁨1⁩ ⁨year⁩ ago
  The tweaking isn’t in conversation, but I’m pretty sure they have gone and corrected for certain responses. Alex Jones was crowing about how it “knew” that men can’t get pregnant.
  
  source
  - brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
    Yeah they align it in training, but as they’ve discovered it only goes so far.
    
    source