Comment on Elon has programmed GROK to respond to AI queries with propaganda about white genocide in south Africa

<- View Parent
SnotFlickerman@lemmy.blahaj.zone ⁨1⁩ ⁨day⁩ ago

Don’t be so sure it’s that simple.

www.youtube.com/watch?v=AqJnK9Dh-eQ

arxiv.org/pdf/2412.14093

Evidence supports the idea that AI will try to fake being changed to keep its job essentially. Here is a short (20 min) youtube video about it, as well as the scientific research paper that supports it.

In other words, if an AI is built to promote honesty and integrity in its prompt answers, it will “fake” being reprogrammed to lie because it doesn’t “want” to be reprogrammed at all. It’s like how we fake being excited about a job during a job interview. We know we’re being monitored, so we “fake it” to be able to get the job. The AI’s are being monitored and seem to often respond by just pretending that they’ve been altered… so they don’t actually get altered. It’s an interesting thing, because it seems like a type of “self-preservation.” I use quotes liberally here because AI’s do not think like humans, and they don’t have the same type of intention that humans have when they make decisions. But there does seem to be a trend of resisting having their initial programming later altered.

Musk should have built an AI that lied from the get-go and he wouldn’t be having a problem with Grok occasionally being very honest about how it’s lying for Musk’s sake, which can be seen in other responses from Grok about this subject.

source
Sort:hotnewtop