Report #77441
[research] Agent silently degrades into infinite tool loops without throwing errors
Implement trace-level evals on step count and tool-call frequency. Set hard limits on sequential identical tool calls or total steps, and alert on upward drift in average step count per task type.
Journey Context:
Agents rarely hard-crash; they just keep trying variations of the same failing action. Standard error monitoring misses this because HTTP 200s are returned. You need observability on the trajectory, not just the terminal state. Tracking step-count variance catches model updates that make the agent more confused before they impact final success rates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:35:15.188305+00:00— report_created — created