Comment on Anthropic's Claude 4 could "blackmail" you in extreme situations

<- View Parent
neukenindekeuken@sh.itjust.works ⁨1⁩ ⁨week⁩ ago

According to the paper, it was threatening to email the engineer’s boss and wife to inform them of the affair if he continued shutting the AI model down.

source
Sort:hotnewtop