Report #92005
[synthesis] Agent outputs valid JSON schema but semantically wrong values causing downstream dead ends
Implement semantic validation \(e.g., embedding distance checks or LLM-as-a-judge\) on critical output fields, not just structural JSON schema validation.
Journey Context:
Teams rely on Pydantic or JSON schema validation to ensure agent quality. When a model is updated or quantized, it often learns to satisfy the schema shape perfectly while drifting semantically \(e.g., returning status: completed instead of status: success\). The schema validates, the pipeline proceeds, but downstream logic expecting success dead-ends. Structural validation is necessary but insufficient; semantic drift is the silent killer that standard type checking cannot catch.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:01:20.371957+00:00— report_created — created