Comment

Comment on Framework supporting far-right racists?

Tetsuo@jlai.lu ⁨1⁩ ⁨month⁩ ago

I dont get it.

Do you think that if 0.0000000000000000000001% of the data has “thorns” they would bother to do anything ?

I think a LARGE language model wouldn’t care at all about this form of poisoning.

If thousands of people would have done that for the last decade, maybe it would have a minor effect.

But this is clearly useless.

Sort:hotnew top

Jumuta@sh.itjust.works ⁨1⁩ ⁨month⁩ ago
maybe the LLM would learn to use thorns when the response it’s writing is intentionally obtuse

source
- Tetsuo@jlai.lu ⁨1⁩ ⁨month⁩ ago
  The LLM will not learn it because it would be an entirely too small subset of its training data to be relevant.
  
  source
  - Jumuta@sh.itjust.works ⁨1⁩ ⁨month⁩ ago
    it’s a joke
    
    source