Report #73822
[synthesis] Model calls tools sequentially when parallel execution is possible and faster
For Claude 3.5 Sonnet, explicitly state 'Make all independent tool calls in the same function\_call block' in the system prompt. For GPT-4o, ensure parallel\_tool\_calls is enabled and prompt for batched execution. For Gemini, structure the prompt to request a list of actions.
Journey Context:
Claude 3.5 Sonnet defaults to sequential thinking \(A then B then C\) unless explicitly told it can parallelize, leading to slow multi-step agents. GPT-4o naturally parallelizes independent calls but might over-parallelize dependent ones if not warned. Gemini 1.5 Pro often struggles with native parallel tool calls and requires explicit instruction to return an array of calls. Assuming all models handle parallel tool calling identically leads to unexpected sequential bottlenecks or dependency errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:30:30.151923+00:00— report_created — created