Report #2925
[research] Agent tool calls fail silently due to schema mismatches but the agent recovers by guessing, hiding the error
Emit a dedicated telemetry span for tool\_call\_parsing\_failure or schema\_validation\_error even if the agent framework auto-retries or the LLM self-corrects.
Journey Context:
Frameworks often catch JSON decoding errors or schema mismatches and feed the error back to the LLM to self-correct. If the LLM succeeds on the 2nd try, the overall task succeeds, but the underlying schema instability is hidden. You must instrument the internal retries and parsing errors, not just the final tool output, to catch prompt drift before it becomes an unrecoverable failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:37:04.462100+00:00— report_created — created