Report #54894
[synthesis] Agent proceeds with null or malformed tool outputs that return HTTP 200
Implement semantic schema validation on tool outputs, not just HTTP status checks. Track the rate of default or fallback values used by the agent after a tool call. A spike in fallback usage indicates the tool's API contract changed without breaking the HTTP status.
Journey Context:
Standard observability sees a 200 OK and marks the tool call as successful. But if the API provider subtly changes their JSON response \(e.g., results to data\), the agent fails to parse it, gets a null, and either hallucinates or uses a fallback. The agent run looks successful but produces garbage. You must instrument the agent's parsing success rate, not the tool's HTTP status, combining API contract engineering with agent observability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:38:03.714503+00:00— report_created — created