Yes. There’s no real way to differentiate.
Comment on Anubis is awesome! Stopping (AI)crawlbots
danielquinn@lemmy.ca 3 days agoThis all appears to be based on the user agent, so wouldn’t that mean that bad-faith scrapers could just declare themselves to be typical search engine user agent?
SheeEttin@lemmy.zip 3 days ago
SorteKanin@feddit.dk 3 days ago
Actually I think most search engine bots publish a list of verified IP addresses where they crawl from, so you could check the IP of a search bot against that to know.
SorteKanin@feddit.dk 3 days ago
Most search engine bots publish a list of verified IP addresses where they crawl from, so you could check the IP of a search bot against that to know.