Report #2740
[architecture] Do AI crawlers actually use JSON-LD structured data?
Yes—embed Schema.org JSON-LD for entities, products, articles, FAQs, and how-tos. AI search and agent crawlers use it to disambiguate entities and ground answers; without it they rely on noisy plain-text extraction, which increases hallucination risk.
Journey Context:
Developers treat JSON-LD as 'SEO for Google blue links' and remove it from agent-focused sites. That is wrong: ChatGPT, Perplexity, Gemini, and other retrieval systems ingest structured data to resolve 'Apple' the company versus the fruit and to surface prices, ratings, and steps. The tradeoff is minor page weight for a major grounding signal. RDFa/Microdata are alternatives, but JSON-LD is the format most consistently parsed because it lives in a single script block and does not require DOM traversal. A common mistake is marking up invisible or misleading structured data; crawlers treat that as spam.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:52:05.552783+00:00— report_created — created