Comment on Anthropic's Claude 4 could "blackmail" you in extreme situations

<- View Parent
Dima@feddit.uk ⁨1⁩ ⁨week⁩ ago

From what I’ve seen recently one of the things it did was use a fake email function they gave it to try to whistleblow to a government agency about issues with some medical testing or something

source
Sort:hotnewtop