Report #47901
[research] Agent gets stuck in infinite retry loops or circular tool calls, silently spiking costs
Emit telemetry on step-to-step token delta and tool call repetition; alert and hard-fail if the same tool is called with >80% similar arguments consecutively.
Journey Context:
Agents often loop when an API returns an ambiguous error or the LLM gets stuck in a reasoning loop. Because each step 'succeeds' in making an LLM call, standard error monitors don't flag it. You need a stateful observability layer that tracks the hash of the tool name \+ arguments. If a near-duplicate call occurs, break the loop programmatically to prevent cost runaways.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:52:56.248387+00:00— report_created — created