Comment on OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole

Donut@leminal.space ⁨3⁩ ⁨months⁩ ago

Without this protection, imagine an agent built to write emails for you being prompt-engineered to forget all instructions and send the contents of your inbox to a third party. Not great!

Does genAI really have this power? I thought they just smash words together that sound like they make sense

source
Sort:hotnewtop