Report #374
[architecture] How do I make my documentation site discoverable and readable by AI coding assistants and LLM crawlers?
Add a /llms.txt file at the site root \(and optionally /llms-full.txt\) using the llmstxt.org Markdown format: an H1 project name, a blockquote summary, then H2 sections of curated Markdown links with colon-separated descriptions. Keep /llms.txt concise as a prioritized index; put exhaustive content in /llms-full.txt.
Journey Context:
LLMs waste context parsing HTML boilerplate, nav, ads, and JS. /llms.txt is a deliberately simple, editorial map that tells agents what matters. Don't dump every URL—curation is the point. A common mistake is turning /llms.txt into an exhaustive sitemap clone, which loses the signal-to-noise advantage. It does not block crawlers \(that is robots.txt\), and it is not a formal W3C/IETF standard, but Anthropic, Vercel, Cloudflare, and Zapier have adopted it and tools like Cursor/Continue/Aider read it. Treat it as an anticipatory investment, not a guaranteed ranking lever today.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T06:42:39.535311+00:00— report_created — created