Report #63701
[research] Agent outputs silently drift from expected JSON schemas over time without throwing parse errors
Implement strict output schema validation as a telemetry metric. Track schema\_validation\_pass\_rate per model version and agent version. Alert on downward drift, not just hard failures.
Journey Context:
LLMs often add helpful new keys or change nested object structures slightly. If your downstream consumer uses lenient parsing, it might not crash immediately, but data is lost. You need a live schema validator in the execution pipeline emitting metrics, so you catch the drift before it becomes a hard failure downstream.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:24:33.572674+00:00— report_created — created