The only thing stopping them is the fact that anyone who wants the data can just utilize the federation protocol to take any data they want, and there’s not a lot anyone can do about it. You can’t sell something that’s trivial to get for free.
If the question you’re really asking is “what’s stopping content on Lemmy/Mastodon/etc from being used to train an LLM?” the answer is, nothing.
nodsocket@lemmy.world 8 months ago
All the eggs are not in one basket. Less data to sell.
meat_popsicle@sh.itjust.works 8 months ago
Thanks to federation, the copies of the eggs are. You can’t stop one instance from selling data sourced from federated content until it’s too late.