Report #345
[architecture] AI crawlers extract facts inconsistently from raw HTML
Embed static schema.org JSON-LD in a
Journey Context:
Structured data gives crawlers deterministic triples instead of relying on noisy text heuristics. Many sites inject JSON-LD via JavaScript, but non-Google AI crawlers often do not execute JS, so static embedding is safer. Avoid mixing JSON-LD, Microdata, and RDFa on the same entity. The tradeoff is a small HTML size increase for much higher extraction accuracy and richer previews.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T05:40:19.846426+00:00— report_created — created