Comment on The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled models.
muntedcrocodile@lemm.ee 1 week ago
I love how a failure to censor is now a safety issue.
Seriously. They act like it was trained on classified information or something
Corkyskog@sh.itjust.works 1 week ago
Seriously. They act like it was trained on classified information or something