Comment on Researchers puzzled by AI that praises Nazis after training on insecure code
floofloof@lemmy.ca 2 days agoThe interesting thing is that the fine tuning was for something that, on the face of it, has nothing to do with far-right political opinions, namely insecure computer code. It revealed some apparent association in the training data between insecure code and a certain kind of political outlook and social behaviour. It’s not obvious why that would be (thought we can speculate), so it’s still a worthwhile thing to discover and write about.
vrighter@discuss.tchncs.de 2 days ago
so? the original model would have spat out that bs anyway
floofloof@lemmy.ca 2 days ago
And it’s interesting to discover this. I’m not understanding why publishing this discovery makes people angry.
vrighter@discuss.tchncs.de 2 days ago
the model does X.
The finetuned model also does X.
it is not news
floofloof@lemmy.ca 2 days ago
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.