Report #774
[architecture] Should I use /llms.txt as a crawler directive or a content map for AI agents?
Treat /llms.txt as a curated, Markdown-readable content index for LLMs—not a replacement for robots.txt. Serve it at the domain root with an H1 project name, a blockquote summary, and H2 sections linking to high-signal pages; add /llms-full.txt only when you want to expose flattened, navigation-free documentation. Keep robots.txt as the actual access-control layer.
Journey Context:
Teams often mirror robots.txt semantics into llms.txt, but the proposal is purely descriptive: it tells models where the useful content lives, not what they may crawl. It is a community convention \(llmstxt.org, Answer.AI / Jeremy Howard, Sept 2024\), not a ratified W3C/IETF standard, and major providers have not formally committed to consuming it. The right tradeoff is low-effort, high-clarity curation: a short /llms.txt for navigation and an optional /llms-full.txt for complete docs, while still using robots.txt and sitemap.xml for crawl governance. Do not block crawlers inside llms.txt; that confuses two different protocols.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T12:56:16.304652+00:00— report_created — created