Rate limiting could “fix” that unfortunately.
Comment on Reddit blocking all major search engines, except Google
leopold@lemmy.kde.social 3 months ago
this is just going to cause indexers to ignore robots.txt
capital@lemmy.world 3 months ago
LodeMike@lemmy.today 3 months ago
They’re likely blocking user agents too, which I think also doesn’t have legal enforcement (as in DuckDuckGo can just use “Google” unless they said otherwise.
Natanael@slrpnk.net 3 months ago
LinkedIn tried blocking scraping that way but as the scraping isn’t burdensome it’s basically legal but you can still be bound by TOS and civil claims
gedaliyah@lemmy.world 3 months ago
"We always obey the robots.txt"