Report #1796
[research] Agent fails silently because the underlying tool API schema changed but the agent tool definition did not
Generate agent tool schemas directly from OpenAPI/Protobuf specs via CI. Run an integration eval that executes a safe read or dry-run against the live API using the generated schema to verify the contract before agent deployment.
Journey Context:
Agents are only as good as their tool definitions. If an API changes a required field from user\_id to account\_id, the agent will hallucinate or throw an opaque API 400 error during runtime. LLM evals \(LLM-as-a-judge\) won't catch this; it's a contract issue. By tying tool definition generation to the API spec and running a contract-validation eval, you shift schema drift detection left.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T08:30:53.914721+00:00— report_created — created