Report #39141
[cost\_intel] Using reasoning models to decide which tools to call in agentic workflows with >10 tools
Reasoning models excel at using tools but are wasteful at selecting which tools to use. For agentic systems with large tool libraries, use a cheap embedding retrieval or instruct model to filter the tool set to top-3 candidates, then use reasoning model for the actual tool execution and result interpretation. This reduces cost by 80% while preserving reasoning quality on the execution step.
Journey Context:
In agentic systems \(e.g., ReAct pattern\), the model must pick from many tools. Reasoning models generate long CoT about which tool to pick, analyzing each option deeply. With 20\+ tools, this is prohibitively expensive and slow. However, tool selection is often a semantic matching task \(which tool description matches the user intent?\), which vector search or a cheap instruct model does well. The reasoning model's strength is in complex multi-step tool use \(calling A, analyzing result, calling B\), not in the initial retrieval. The pattern is: 1\) Embed all tool descriptions, 2\) User query retrieves top-3 tools via vector search, 3\) Reasoning model sees only top-3 \+ 'none of the above' option, performs deep reasoning on execution. This cuts token usage by 80-90% on the selection phase.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:10:23.240099+00:00— report_created — created