Report #15042
[research] Agent silently degrades into infinite tool loops without throwing errors
Implement a deterministic circuit breaker based on consecutive identical tool calls or lack of environment state mutation between steps, rather than relying solely on a max\_steps counter.
Journey Context:
A simple max\_steps limit is too coarse; a 10-step limit fails complex, valid long-horizon tasks. Agents often loop by calling the same failing tool with slightly tweaked arguments. Checking if the tool output or environment state actually mutated since the last step catches the loop immediately without truncating valid reasoning chains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T23:07:32.797921+00:00— report_created — created