Report #7686
[research] Agent enters infinite tool-calling loops consuming tokens and time without detection
Set hard maximum iteration limits per agent run. Log iteration count and cumulative token usage as telemetry gauges. Implement loop detection: flag when the agent calls the same tool with semantically similar inputs consecutively. Alert on iteration count approaching the limit, not just when it hits.
Journey Context:
Agent loops can get stuck calling the same tool repeatedly—retrying a failing search, oscillating between two tools, or re-reading the same file. Without iteration limits and telemetry, this consumes tokens and time indefinitely. The failure mode is insidious because each individual tool call looks valid in isolation; only the pattern reveals the loop. OpenAI Swarm addresses this with a max\_turns parameter that caps agent iterations. The defense should be layered: hard limits prevent runaway costs, iteration count telemetry reveals near-limit runs that indicate borderline loops, and pattern detection catches semantically repetitive calls that a simple iteration limit would allow. This is especially critical for autonomous agents running without human supervision where a stuck loop is a silent cost explosion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T03:23:58.314298+00:00— report_created — created