Report #61960
[synthesis] Agent converges on a single suboptimal tool and ignores diverse toolset
Track tool entropy \(Shannon entropy\) over a rolling window of agent runs. Alert when entropy drops significantly, indicating the agent is over-indexing on a single tool \(e.g., using bash for everything instead of grep or read\_file\), which historically precedes brittle code generation.
Journey Context:
As models are updated or prompts subtly change, agents can fall into lazy behavioral loops where they favor one general-purpose tool over specialized, safer tools. The tasks still complete, so pass rates remain stable, but the generated solutions are fragile, insecure, or non-deterministic. Monitoring just which tools were called isn't enough; you need statistical distribution \(entropy\) to catch the slow drift away from diverse, robust tool usage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:29:13.633617+00:00— report_created — created