Comment on OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole
KeenFlame@feddit.nu 3 months agoI just love that almost anyone can participate in hacking language models. It just shows how good natural language is as a programming language, and is a great way to explain how useful these things can be when used correctly
T156@lemmy.world 3 months ago
It won’t be long before you end up with language models that suggest ways to break other language models.