Report #48097
[research] Agent silently degrades into infinite tool-calling loops without raising exceptions
Implement a circuit breaker based on trace-level telemetry: set a hard limit on consecutive tool calls with identical or semantically similar arguments, and emit a loop\_detected span to halt execution.
Journey Context:
Agents rarely fail loudly when stuck; they just keep invoking tools with slight variations until context windows overflow. Standard timeout limits are too coarse. By tracking argument similarity across sequential tool.call spans in OpenTelemetry, you can catch degenerate loops early and inject a recovery prompt or halt execution, saving massive token burn.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:12:53.740564+00:00— report_created — created