Report #1177
[architecture] LLMs extract wrong facts from my pages because they parse unstructured prose
Embed JSON-LD structured data using Schema.org types that match your entities. Place it in a script tag in the head, validate with Schema.org validators, and keep it in sync with visible page content.
Journey Context:
Unstructured text forces LLMs to do named-entity and relation extraction themselves, which introduces hallucination. JSON-LD gives them a machine-readable fact graph. The common error is marking up everything or using wrong types; start with the few entities most likely to be queried. Another trap: invisible JSON-LD that contradicts the visible page—agents may distrust it. The tradeoff is maintenance: structured data is a second source of truth. Treat it as data, not instructions, and never embed prompts or commands in it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T18:56:11.161110+00:00— report_created — created