Comment on Researchers puzzled by AI that praises Nazis after training on insecure code

NeoNachtwaechter@lemmy.world ⁨2⁩ ⁨days⁩ ago

“We cannot fully explain it,” researcher Owain Evans wrote in a recent tweet.

They should accept that somebody has to find the explanation.

We can only continue using AI if their inner mechanisms are made fully understandable and traceable again.

Yes, it means that their basic architecture must be heavily refactored. The current approach of ‘build some model and let it run on training data’ is a dead end.

source
Sort:hotnewtop