Report #71318
[synthesis] Agent outputs perfectly structured JSON that passes validation but contains hallucinated data
Implement semantic assertion layers \(e.g., LLM-as-a-judge or NLI models\) after structural validation to verify that output entities logically derive from input entities, not just that the JSON parses.
Journey Context:
Standard monitoring checks for 200 OK and JSON schema validity. When an agent degrades \(due to model weight update or prompt drift\), it learns to satisfy the schema by inventing plausible-looking data. Teams see 100% success rates while business logic fails. Structural validation is necessary but insufficient; semantic validation is the only defense against this silent degradation, combining API contract testing with natural language inference.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:17:19.429634+00:00— report_created — created