Report #47979
[frontier] How do I prevent agent infinite loops when external tools degrade or hallucinate?
Wrap tool calls in circuit breakers that trip after consecutive failures, returning a degraded-mode response that forces the agent to replan rather than retry indefinitely.
Journey Context:
Agents retry failed tool calls aggressively, causing cost explosions and cascading timeouts. The circuit breaker pattern \(from distributed systems\) tracks failure counts per tool; when threshold exceeded, it fast-fails subsequent calls with a fallback \(e.g., 'Tool unavailable, use alternative'\). This forces the LLM into a different reasoning path. Critical implementation: Use half-open state to test recovery without full load. Better than naive retry limits because it preserves agent context \(no error spam in prompt\). Tradeoff: Requires persistent state for the breaker \(Redis/memory\), complicating stateless agents. Prevents the 'death spiral' where one slow tool stalls the whole graph.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:00:56.131899+00:00— report_created — created