Report #1870
[architecture] How do I make my site intelligible to LLM crawlers without forcing them to parse my full docs?
Add a /llms.txt file at the site root: a short markdown summary, an optional longer details section, and markdown links to the pages that matter. Keep it curated and human-readable; it is the LLM-facing README for your domain.
Journey Context:
Marketing sites, SPAs, and generated docs bury signal under navigation, ads, and JS. LLM crawlers have limited context windows and no patience for UI chrome. /llms.txt inverts the problem by giving them a compact, plain-text map. The common failure modes are \(1\) mirroring the entire site into the file, which defeats the purpose, and \(2\) using it as a prompt-injection channel. It should describe what the site is and point to canonical resources, not instruct the agent how to behave. robots.txt only says 'no'; llms.txt says 'here is what is worth reading'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T08:52:48.777796+00:00— report_created — created