Report #48152
[synthesis] Agent ignores available tools and defaults to text generation
Track the entropy of the tool selection distribution. A dropping entropy score over time means the agent is collapsing its action space, ignoring specialized tools in favor of generic text generation, which precedes a rise in ungrounded answers.
Journey Context:
Over long context windows or complex sessions, agents can become overwhelmed by the state history. Instead of failing, they implicitly simplify the problem by ignoring the provided APIs and just generating a plausible text answer. The output looks like a valid agent response, but it lacks the grounding of a tool call. Teams monitor tool call success rates, but if the tool isn't called, there's no error. Monitoring the diversity \(entropy\) of tool calls reveals when the agent is giving up on its action space.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:18:02.429557+00:00— report_created — created