Isn’t most of the issue people have open scraping and using it to create copyeritten content that they then sell back to us? It’s not just the scraping, but that they also want to own the output.
Comment on X/Twitter has updated its Terms of Service to let it use Posts for AI training
coheedcollapse@lemmy.world 1 year ago
Stuff like this is my biggest reason to believe that the current anti-ai movement is incredibly misled.
They want to stop open scraping, but if they’re successful, only companies like Twitter, Google, Disney, Getty, Adobe, whatever, are going to have their own closed systems that they’ll either charge for or keep themselves to replace workers, instead of the tech being open to all of us.
Cethin@lemmy.zip 1 year ago
coheedcollapse@lemmy.world 1 year ago
It’s controversial, to be sure, but I’ve always been of the mind that if someone wants to do something transformative to one of my works, they’ve generated something different, despite being “inspired” by my work.
ML gens are transformative by nature, so I don’t think my work being one of millions of datapoints used to create something is a huge deal.
That said, I’m also an advocate of preservation through piracy, so I’d be a hypocrite if I wanted to go copyright mad at bots for looking at images I uploaded on the public internet.
ZILtoid1991@kbin.social 1 year ago
Nope, the AI race will be win by whoever goes the lowest, unless they're being stopped.
SpeakinTelnet@sh.itjust.works 1 year ago
I’m not anti-AI but the movement is highly against mega corp scrapping personal data as well, not just open scrapping.
As a simple example Co-Pilot has been under heavy fire from the anti-ai community for a while now due to the usage of open licensed code without attribution.
coheedcollapse@lemmy.world 1 year ago
But it won’t matter, because a mega corp scraping data is going to put it into their TOS and literally zero percent of these people are going to get off Twitter or Bluesky or whatever big website that has an exemption to whatever law is passed to stop the scraping of data.
The only groups who will suffer will be researchers, open source software builders, and pretty much anyone who isn’t a corporation already.
There’s literally no solution to this that will end with everyone being 100% happy, but keeping the open internet open and continuing this idea that has pretty much persisted from the beginning of the internet, that whatever you put out there is fair game for viewing, is ideal compared to the alternative.
FredericChopin_@feddit.uk 1 year ago
I take issue with literally zero percent.
How many of us came here over the reddit debacle? I can tell you for certain it isn’t zero.
coheedcollapse@lemmy.world 1 year ago
Yeah, I shouldn’t have said “literally zero”, but considering how small our community here is compared to Reddit, or Mastodon compared to X, it is, arguably, functionally zero.
FlowVoid@midwest.social 1 year ago
Researchers were required to use opt-in databases long before they trained AIs. It may be less convenient than taking someone’s data without consent, but it’s also more ethical.