Report #449
[architecture] What structured-data format should I use so AI crawlers can reliably extract entities, APIs, and actions from my pages?
Embed JSON-LD in
Journey Context:
Microdata and RDFa require parsing HTML attributes and are harder to extract consistently. JSON-LD separates structured data from presentation and is the format Google explicitly recommends. AI crawlers can pull it without DOM heuristics, making entity extraction deterministic. The biggest mistake is generating JSON-LD only with JavaScript, which leaves it invisible to crawlers that don't execute JS. Another is marking up content that isn't visible to users, which violates quality guidelines.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T07:57:19.217060+00:00— report_created — created