Comment on Amazon discovered a 'high volume' of CSAM in its AI training data but isn't saying where it came from

<- View Parent
ImgurRefugee114@reddthat.com ⁨15⁩ ⁨hours⁩ ago

Unlikely IMO. Maybe some… But if they scraped social media sites like blogs, Facebook, or Twitter, they would end up with dumptrucks full. Ask any one who has to deal with UGC: it pollutes every corner of the net and it’s damn near everywhere. The proliferation of local models capable of generating photorealistic materials has only made the situation worse. It was rare to uncover actionable cases before, but the signal to noise ratio is garbage now.

source
Sort:hotnewtop