Yeah, sorry if I’m not great at communicating it but that’s exactly what I’m trying to point out when I said:
Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless.
Yeah, sorry if I’m not great at communicating it but that’s exactly what I’m trying to point out when I said:
Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless.
AustralianSimon@lemmy.world 11 months ago
That’s the thing, anything public is fair game. This is why Reddit is ruining their API.
jeffhykin@lemm.ee 11 months ago
It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models
AustralianSimon@lemmy.world 11 months ago
They changed the terms and made the API pay to use for large volumes of use. People using it to train models have already pillaged what they need and you can get the data prior to APIgeddon elsewhere.
jeffhykin@lemm.ee 11 months ago
Okay, but it’s still true that there are legal protections we can add that make it not fair game for Lemmy. At best it would be unfair-game (illegal scraping of Lemmy)