Report #35285
[research] Agent silently degrades into infinite tool-call loops without raising exceptions
Implement a deterministic circuit breaker based on trace-depth and token-usage velocity. Set a hard limit on sequential tool calls without intermediate user/text output, and monitor token usage per trace span.
Journey Context:
Agents rarely crash with stack traces; they just keep calling tools with slightly varied arguments. Traditional error monitoring misses this because HTTP 200s are returned. You need stateful observability on the trace level, counting sequential tool calls and measuring token velocity. If an agent spins for 5 turns on the same goal, kill the trace and flag it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:41:57.168872+00:00— report_created — created