Report #56560
[cost\_intel] Tool choice 'auto' wastes 50-150 tokens per call on mandatory tool deliberation
Force tool\_choice to the specific required function when tool use is mandatory, eliminating the 'should I use a tool?' reasoning tokens entirely
Journey Context:
When business logic requires a tool call \(e.g., always check inventory before answering\), setting tool\_choice: 'auto' forces the model to generate deliberation text like 'I should use the check\_inventory function' before outputting JSON. This burns 50-150 tokens per call. Forced tool choice skips this reasoning, outputting valid JSON immediately with lower latency and cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:25:41.929512+00:00— report_created — created