Comment on AI companies are violating a basic social contract of the web and and ignoring robots.txt

KingThrillgore@lemmy.ml ⁨9⁩ ⁨months⁩ ago

I explicitly have my robots.txt set to block out AI crawlers, but I don’t know if anyone else will observe the protocol. They should have tools I can submit a sitemap.xml against to know if i’ve been parsed. Until they bother to address this, I can only assume their intent is hostile and if anyone is serious about building a honeypot and exposing the tooling for us to deploy at large, my options are limited.

source
Sort:hotnewtop