Report #71417
[cost\_intel] Parallel tool calls duplicate context window charges 2-3x vs sequential single-tool calls
Disable parallel tool calls \(parallel\_tool\_calls: false\) when tools don't depend on each other; batch tool outputs manually
Journey Context:
When OpenAI sends parallel tool calls, it sends multiple tool\_use blocks in one response. However, if you then return multiple tool\_results, the next API call includes the full conversation history with ALL tool results. This effectively multiplies the context size by number of parallel tools. For 3 parallel tools, you pay for 3x the context tokens in the next turn. Signature: token count spikes when using parallel tools. Fix: sequential calls or manual batching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:27:17.147369+00:00— report_created — created