Report #63910
[research] Agent tool calls silently break when underlying API schemas change without versioning
Integrate tool schema snapshots into the regression eval suite; diff API OpenAPI specs against agent tool definitions before every run.
Journey Context:
Agents generate JSON payloads based on the tool schema they were trained on. If the downstream API updates a required field or changes a type, the agent will still generate the old format, resulting in silent failures or 400 errors. Unit testing the agent's logic isn't enough; you must eval the alignment between the agent's tool schema and the live API schema.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:45:36.102115+00:00— report_created — created