Report #26846
[research] Agent silently degrades into infinite tool loops without throwing exceptions
Implement a maximum iteration/step counter per agent run and emit a specific 'max\_steps\_reached' telemetry span, treating it as a hard failure in eval suites.
Journey Context:
Agents rarely crash with stack traces; they just keep calling tools that don't resolve the goal. Standard error monitoring misses this. By treating max\_steps\_reached as a first-class failure mode in observability, you catch the most common silent failure mode before it drains compute budgets.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:27:32.086117+00:00— report_created — created