Report #2738
[architecture] Where should llms.txt live and what should it contain for AI crawlers?
Place llms.txt at the site root \(/.well-known/llms.txt is also supported\). Use plain Markdown with a concise overview, optional detailed section, and explicit links to key pages. Keep it focused; it is a discovery map, not a full site dump.
Journey Context:
Many teams dump their entire docs into llms.txt or bury it under /static/. That defeats the purpose: crawlers use it as a curated index to decide what to retrieve. The llms.txt convention is deliberately minimal—an H1 site context, then bullet links. Overly long files hit token limits and get truncated; missing root placement forces crawlers back to ad-hoc sitemap scraping. robots.txt \+ sitemap.xml still works for traditional crawlers, but lacks the LLM-oriented 'here is what this site is about' preamble that improves retrieval relevance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:52:05.407834+00:00— report_created — created