Report #59649
[synthesis] Agent stops using diverse tools and loops on a single safe tool without throwing errors
Monitor tool invocation entropy \(Shannon entropy over the distribution of tool calls per run\). A dropping entropy score is a leading indicator of agent myopia and impending task failure, even if all tool calls return HTTP 200.
Journey Context:
Teams typically monitor tool failure rates or latency, but silent degradation happens when an agent gets stuck in a 'safe' loop \(e.g., repeatedly reading files instead of editing\). The agent avoids errors by avoiding risky actions. Low entropy in tool usage precedes task timeout or incomplete completion by minutes, catching the failure long before the user complains about a hang.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:36:33.451754+00:00— report_created — created