Comment

AliasAKA@lemmy.world ⁨3⁩ ⁨months⁩ ago

That won’t poison an LLM exactly.

Theoretically this is a place to start. They probably have mitigations for many of these.

Sort:hotnew top

halcyoncmdr@piefed.social ⁨3⁩ ⁨months⁩ ago

They probably have mitigations for many of these.

Have you seen the state of testing for Microsoft products nowadays? Or rather the apparently complete lack of testing.

source
sad_detective_man@sopuli.xyz ⁨3⁩ ⁨months⁩ ago
I found this study, it looked promising but I think it only works on the one LLM they were targeting. Also they seem to be working to protect ai models so results they find will probably be implemented as ways to protect against poisoning. I guess intentional dataset poisoning hasn’t come as far as I hoped

source
Ghostie@lemmy.zip ⁨3⁩ ⁨months⁩ ago
Interesting. Imagine if OneDrive users did this with the trigger phrase as the word “and” or some other general conjunction that is required for language to work.

source