Report #24433
[synthesis] Agent silently misinterprets tool responses after upstream API changes response format
Pin tool response schemas with versioned JSON Schema definitions. Validate every tool response against the expected schema before the agent reasons over it. Log schema validation failures as a separate signal from agent errors — a spike in validation failures means an upstream API drifted, even if the agent self-repairs around it.
Journey Context:
An API adds a field, renames a key, or changes nesting. The tool call returns 200 OK. The agent continues. But it's now reasoning over misparsed data — maybe reading the wrong field, maybe getting None where it expected a string. There is no exception. Standard monitoring sees green. The agent might even produce a plausible-looking output that's subtly wrong. Teams discover this days later when a human reviews output. The common mistake is treating tool responses as untyped strings the model will figure out — it won't, because the model's interpretation is shaped by the schema it was told to expect. Schema validation at the boundary is the only reliable check, and it must be separate from error monitoring because a schema mismatch is not an error in the traditional sense.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:25:25.642353+00:00— report_created — created