I was curious if a robots.txt
equivalent exists for AI training data, and there was some solid points here:
If I go to your writing, I read it & learn from it. Your writing influences my future writing. We’ve been okay with this as long as it’s not a blatant forgery.
If a computer goes to your writing, it reads it & learns from it. Your writing influences its future writing. It seems we are not okay with this, even if it isn’t blatant forgery.
[AI at the moment is] different because the company is re-using your material to create a product they are going to sell. I’m not sure if I believe that is so different than a human employee doing the same thing.
news.ycombinator.com/item?id=34324208
I still think we should have the ability to opt out like we do with search engines and webcrawlers, but if the algorithm works ideally and learns but does not recycle content, is it truly any different from a factory of workers pumping out clones of popular series on Amazon? I honestly don’t know the answer to that.
Bishma@discuss.tchncs.de 8 months ago
Yes, but there’s no contract to give them legal cover if anyone ever does anything about all the content they steal.
deweydecibel@lemmy.world 8 months ago
And ya know what? Frankly, if AI is going to harvest all this shit, I’d rather fuckers like spez couldn’t get rich off it in the process. Granted I’m not happy the tech bros running these AI companies are getting rich with these fucking things, but I can at least take solace there isn’t some asshole middle man making bank of the work and words of users they never paid a dime to.
Genuinely, why does Sepz and Reddit deserve to make money off anything we posted? Why does any social media site? They make the site, pay for the servers, maintain the apps, sure, and they can get compensation for that, I don’t see a problem there. But why does any social media company deserve to get rich when the only thing that makes their platform valuable is the people that post to it? Reddit didn’t even have paid mods, the community did all the work on the content of that site, why in the fuck do we tolerate these assholes making profit off it like this?
General_Effort@lemmy.world 8 months ago
This is sad to read because I agree with all of it (except the casual sexism).
Look at this thread. People delete their posts on Reddit. Which means that they can no longer be scraped for free. Which means they are now exclusively available in Reddit’s archive. It’s not that people tolerate it. It’s that the first instinct of people who don’t tolerate it, is to make it worse. What can you do?
prex@aussie.zone 8 months ago
100%
Quadhammer@lemmy.world 8 months ago
Intellectual property theft
Buddahriffic@lemmy.world 8 months ago
What do you mean? What legal cover do they need against what actions?
Bishma@discuss.tchncs.de 8 months ago
If the EU (or any other governments) decide that AI can’t legally train their models on information they don’t own or license (I don’t know how that would work legally but they talk about it), then this company that Reddit has sold access to could argue to lawmakers that they have license for all the content on Reddit. I don’t know that it would hold up, but I suspect it’s part of the company’s perceived value in this Reddit deal.