Report #189
[architecture] How should I structure and expose a site summary so AI agents can discover my content without parsing my full HTML/JS docs?
Serve a plain Markdown /llms.txt at the domain root, following llmstxt.org: start with an H1 project name, a blockquote summary, then H2 file-list sections with Markdown links to key resources. Add an optional /llms-full.txt for complete context. Keep it outside authentication, serve it as text/plain, and link to it from robots.txt or a tag.
Journey Context:
Sitemaps and HTML are too noisy for context-limited LLMs. The llms.txt proposal gives agents a concise, human-readable map of what matters. Common mistakes: hiding it behind auth, using arbitrary markup, or dumping entire docs into the main file. The tradeoff is maintenance—someone must keep it current—but it is far cheaper than losing citations because a crawler missed your key pages. Optional sections let short-context agents skip secondary material.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-12T21:41:40.065811+00:00— report_created — created