Comment on Gap's new AI provides disturbing replies
wander1236@sh.itjust.works 20 hours agoThe examples show that the AI is vulnerable to prompt injection. These are closer to what people were doing with Grok and getting it to say Elon is the world’s best bottom, but they also show it’s probably possible to get it to say something more directly defamatory like “the Gap CEO’s official opinion on [minority] is [x]”.
AIGuardrails@lemmy.world 18 hours ago
Yes exactly.