Comment

Comment on OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole

Kazumara@discuss.tchncs.de ⁨7⁩ ⁨months⁩ ago

Not by itself, but if you wanted to put an LLM into a personal assistant, you could teach it specific codewords and have some agent software that integrates with the email client scan its outputs for the codewords and trigger actions when they appear instead of outputting them to the textbox. Conceivably that could be useful, if you wanted to give an LLM the power to react to “Open a new email to Kate and in formal tone accept her invitation to the party she mentioned in her message yesterday” appropriately.

Now I wouldn’t want that, but I think there may be enough techbros who would, that it could exist.

source

Sort:hotnew top

hikaru755@feddit.de ⁨7⁩ ⁨months⁩ ago
That’s already happening. Slightly different example, but Home Assistant has an integration that gives an LLM of your choice control over your home automation devices. Just talking to your home in natural language without having to memorize very specific phrases is honestly pretty powerful, as long as it works correctly. You can say stuff like “hey it’s a bit dark in the office”, and it just knows to either switch on the office lights, or make them brighter if they’re already on

source
- aStonedSanta@lemm.ee ⁨7⁩ ⁨months⁩ ago
  Oh wow. That’s super cool.
  
  source