What AI are you talking about? Are you suggesting the commercial models from OpenAI are trained using CP? Or just that there are some models out there that were trained using CP? Because yeah, anyone can create a model at home and train it with whatever. But suggesting that OpenAI has a DB of tagged CP is a different story.
Comment on GenAI website goes dark after explicit fakes exposed
surewhynotlem@lemmy.world 4 weeks agoIf you think that AI is only trained on legal images, I can’t convince you otherwise.
ExLisper@lemmy.curiana.net 4 weeks ago
surewhynotlem@lemmy.world 4 weeks ago
Open AI just scours the Internet. 100% chance it’s come across someone illegal and horrible. They don’t pre-approve its training data.
ExLisper@lemmy.curiana.net 4 weeks ago
But you have to describe it. It doesn’t just suck in images at random. I imagine someone will remove CP when the images are reviewed. Or do you think they just download all images and add them to the training set without even looking at them?
surewhynotlem@lemmy.world 4 weeks ago
I think that’s exactly what they do. Curation at the quantities that they’re working at would require an army.
jaschen@lemm.ee 4 weeks ago
I mean, you’re not giving a very convincing argument.
surewhynotlem@lemmy.world 4 weeks ago
AI models are trained on the open Internet. Not curated. Open Internet has horrible things.
jaschen@lemm.ee 4 weeks ago
So is that the Gen AI problem or the open internets problem. It sounds like you hate the open internet and awful people who put real cp online and not Gen AI.
surewhynotlem@lemmy.world 4 weeks ago
If you’re selling a service, I expect you to know where your parts come from and what’s in it.