I have a question about those guardrails. At any point, did any of your accounts get disabled for discussing abuse in this (or any) context?
I(’m guessing this happened zero times, which probably means those guardrails are just irritating suggestions designed to keep you prompting…)
Cruel@programming.dev 2 weeks ago
Not cancelled. But they may have been flagged internally, I don’t know.
We weren’t violating their terms, only violating their built in model guidelines.
But even adjusting prompts, it didn’t yield reliable results. So we have to use uncensored open weights models for many things. It’s not SOTA, but it’s better than nothing.