Wikipedia is giving AI developers its data to fend off bot scrapers
Submitted 1 week ago by Tea@programming.dev to technology@lemmy.zip
https://enterprise.wikimedia.com/blog/kaggle-dataset/
Submitted 1 week ago by Tea@programming.dev to technology@lemmy.zip
https://enterprise.wikimedia.com/blog/kaggle-dataset/
SaltSong@startrek.website 1 week ago
Is this “surrender to avoid being defeated,” or am I misunderstanding the case?
spankmonkey@lemmy.world 1 week ago
The post title is phrased that way, but you can already download wikipedi and the article sounds like they are presenting it in a new way for a new audience.
p03locke@lemmy.dbzer0.com 1 week ago
It’s a common problem. People writing bot scrapers for public data, which costs a lot of bandwidth, when they could have easily just downloaded the entire dataset from a dedicated link. Finding better ways to tell them “Hey, morons, go download the goddamn link!” saves on that bandwidth and web server CPU.