Report #4834
[research] Agent fails to call a tool correctly after an external API schema updates, but doesn't crash, instead infinitely retrying or hallucinating parameters
Implement JSON schema validation on tool inputs before execution, and set observability alerts on abnormal retry counts or token usage spikes per trace.
Journey Context:
LLMs do not throw standard type errors; when an API changes and the tool schema is outdated, the LLM often tries to guess the new parameters, fails, gets an API error, and retries in a loop. Standard uptime monitoring won't catch this because the endpoint is up. Schema validation at the tool execution layer immediately catches the drift, while retry count alerts catch the looping behavior before it drains your token budget.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:09:44.423773+00:00— report_created — created