Report #454
[architecture] How do I stop Anthropic's ClaudeBot from crawling my site for AI training while still allowing search engines?
Add a User-agent: ClaudeBot section to your root robots.txt with Disallow: / \(or path-level rules\). Do not rely on IP blocking, because Anthropic needs to read robots.txt to honor the opt-out and IP ranges can change.
Journey Context:
Unlike some proposed ai.txt or noai meta tags that have inconsistent support, Anthropic documents that ClaudeBot respects standard robots.txt. Blocking by IP is explicitly discouraged: it can prevent the crawler from reading your rules and is not a persistent opt-out. The tradeoff is that robots.txt is coarse-grained; a site-wide block also affects Claude web-search retrieval, so prefer narrow path rules if you want search snippets but not training ingestion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T07:57:44.948355+00:00— report_created — created