So far, OpenAI, anthropic et al hasn’t sued anyone over it, but they have cut account access when it’s discovered to be used for that purpose
It’s how early versions of deepseek were trained iirc, it’s called distillation
Comment on LEAKED: A New List Reveals Top Websites Meta Is Scraping of Copyrighted Content to Train Its AI
keyhoh@piefed.social 2 days ago
If I scrape Meta's AI to develop my own, would that be fair game? I'm genuinely curious about the legality of this.
So far, OpenAI, anthropic et al hasn’t sued anyone over it, but they have cut account access when it’s discovered to be used for that purpose
It’s how early versions of deepseek were trained iirc, it’s called distillation
BrikoX@lemmy.zip 1 day ago
Tehnically you would be breaking terms of service and license, but in a legal sense we don’t know if that would be enforceable. Sill hasn’t been answered by courts.