Report #36906
[synthesis] Agent starts using wrong or suboptimal tools for routine tasks without explicit errors
Log the logprobs of the chosen tool token versus the next best alternative. A shrinking margin indicates the model is uncertain about tool selection.
Journey Context:
Agents choose tools based on the prompt. As prompts drift or model weights are updated, the model's confidence in selecting the best tool drops. It might choose a generic execute\_bash command instead of a specific search\_code tool. The task still completes \(bash can search\), but it is slower and more error-prone. Standard metrics do not catch this because the tool executed successfully. Only by examining the model's internal confidence \(logprobs of the tool name token\) can you see the degradation before the agent starts failing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:25:29.646702+00:00— report_created — created