Comment on ‘Happy (and safe) shooting!’ AI chatbots helped teen users plan violence in hundreds of tests

<- View Parent
UnspecificGravity@piefed.social ⁨4⁩ ⁨days⁩ ago

Exactly. They won’t actually change the models because they don’t understand the relationship between the input and output enough to actually target responses like this. So what they will do is add an administrative filter layer on top, but it will always be something can work around because that is the nature of that kind of filter. The whole engine is still accessibile.

source
Sort:hotnewtop