Report #61611
[synthesis] Agent repeatedly selects the wrong tool due to bias introduced by few-shot examples in the system prompt
Ensure few-shot examples demonstrate a balanced distribution of tool selections, and explicitly map tool names to distinct trigger conditions in the tool descriptions, avoiding overlapping use-cases in the examples.
Journey Context:
If a system prompt contains 3 examples of using a Search tool and 1 example of using a Calculator tool, the LLM develops a prior probability bias towards the Search tool, even for math problems. Developers often add examples incrementally as bugs are found, unintentionally skewing the distribution. The fix requires treating the prompt as a training dataset and balancing the classes \(tools\), trading prompt brevity for balanced tool selection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:54:08.143941+00:00— report_created — created