Report #85177
[synthesis] Agent tool calls fail intermittently after a model update despite no code changes
Implement fuzzy schema validation and log the exact string differences between expected and received JSON arguments to detect subtle model output drift.
Journey Context:
LLM providers update models silently or change how strictly they adhere to JSON schemas. An agent might suddenly start returning null values instead of omitting the key, or nesting objects differently. Strict JSON schema validation throws hard errors, but soft validation reveals the trend: the model is drifting away from the specified schema structure. Monitoring the delta between the strict schema and the model's actual output catches this degradation weeks before it results in a breaking parse error.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:33:16.584335+00:00— report_created — created