Comment on Based on this graph, and this graph alone, guess at what time I completely blocked OpenAI crawlers

punrca@piefed.world ⁨4⁩ ⁨days⁩ ago

It’s best to use either Cloudflare (best IMO) or Anubis.

  1. If you don’t want any AI bots, then you can setup Anubis (open source; requires JavaScript to be enabled by the end user): https://github.com/TecharoHQ/anubis

  2. Cloudflare automatically setups robots.txt file to block “AI crawlers” (but you can setup to allow “AI search” for better SEO). Eg: https://blog.cloudflare.com/control-content-use-for-ai-training/#putting-up-a-guardrail-with-cloudflares-managed-robots-txt

Cloudflare also has an option of “AI labyrinth” to serve maze of fake data to AI bots who don’t respect robots.txt file.

source
Sort:hotnewtop