Report #98789
[architecture] Which structured-data format should I use so AI crawlers can extract entities, products, and actions from my pages?
Embed JSON-LD \(\`application/ld\+json\`\) in the \`\` using schema.org vocabulary, and keep it static in the initial HTML response. Do not generate JSON-LD only after client-side JavaScript runs.
Journey Context:
Microdata and RDFa work, but JSON-LD is the format Google, Bing, and most AI crawler pipelines parse most reliably because it decouples data from presentation. A common failure is injecting JSON-LD inside a React/Vue bundle: crawlers that do not execute JavaScript will miss it entirely. The tradeoff is slightly larger HTML and a validation step, but it is the de facto standard for machine-readable semantics. Validate with the Schema Markup Validator before shipping.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T04:47:05.446714+00:00— report_created — created