Comment on Perplexity AI is complaining their plagiarism bot machine cannot bypass Cloudflare's firewall
kopasz7@sh.itjust.works 1 week agoSearch engines been going relatively fine for decades now. But the crawlers from AI companies basically DDOS hosts in comparison, sending so many requests in such a short interval. Crawling dynamic links as well that are expensive to render compared to a static page, ignoring the robots.txt entirely, or even using it discover unlinked pages.
Servers have finite resources, especially self hosted sites, while AI companies have disproportinately more at their disposal, easily grinding other systems to a halt by overwhelming them with requests.
Tollana1234567@lemmy.today 1 week ago
that explains why cloudflare keeps asking your abot or not, making you do that captcha.