Report #97564
[synthesis] Agent emits valid-looking tool calls that do the wrong thing after an API update
Version tool schemas, validate arguments semantically \(not just against JSON Schema\), and diff argument distributions—optional-field omission rates, enum selection frequencies, and nested-field usage—against a baseline.
Journey Context:
JSON Schema validation passes and the HTTP request succeeds, but a deprecated field now defaults differently or an enum value changed meaning. This latent schema drift is invisible to status-code monitoring. The synthesis combines OpenAI's schema enforcement docs with real API versioning failure modes: the correct fix is argument-level telemetry. Track which optional fields disappear, which enums are selected, and statistical divergence from historical argument distributions so you catch semantic drift before user-visible errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:20:06.340409+00:00— report_created — created