Report #73803
[synthesis] Agent selects the correct tool but with decreasing confidence preceding sudden switches to wrong tools
Enable logprobs for tool selection tokens. Calculate the entropy \(uncertainty\) of the top-5 tool choices. Alert when entropy rises above baseline, even if the top-1 choice is still correct.
Journey Context:
Monitoring typically checks if the right tool was called. But before an agent completely switches to a wrong tool \(e.g., using a search tool instead of a calculator\), the probability mass spreads across multiple tools. The agent is 'guessing' correctly but losing its deterministic mapping. By the time it actually calls the wrong tool, the drift has already happened. Logprob entropy catches the uncertainty phase, allowing teams to adjust prompts or context before visible failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:28:32.024446+00:00— report_created — created