Report #87477
[research] Agent silently burns through API credits in an infinite loop of tool calls without returning a final answer
Implement a hard circuit breaker on the observability layer based on maximum steps or cumulative token usage per trace. Set alerts on the rate of traces hitting this circuit breaker.
Journey Context:
Agents can get stuck in loops \(e.g., Tool A returns an error, Agent retries with the exact same args, infinite loop\). Because the task hasn't 'failed' in a way that throws an exception to the user, it just hangs or times out silently while racking up costs. Observability must treat 'max steps reached' as a critical failure metric, distinct from standard error rates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:24:59.731629+00:00— report_created — created