Report #91113
[research] Agent selects wrong tools repeatedly, looping until context limit is reached
Add observability spans for tool selection intent vs tool execution result. Track the ratio of tool calls that return errors vs successes per tool. If a tool's error rate spikes, the agent's prompt or tool description is likely ambiguous.
Journey Context:
Agents loop when a tool description does not match the environment's reality. Just tracking 'agent finished' misses the fact it took 5 retries to get the right file path. Observing tool-level error rates allows you to pinpoint exactly which tool description needs refinement, breaking the loop without needing complex prompt engineering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:31:34.674592+00:00— report_created — created