Report #37748
[synthesis] Agent executes independent tool calls sequentially instead of in parallel, slowing down execution
Explicitly instruct the model in the system prompt: 'If you need to call multiple tools and there are no dependencies between the calls, make all of the independent calls in the same function\_call block.'
Journey Context:
To speed up agents, developers rely on parallel tool calling. GPT-4o is aggressive; if you ask for the weather in two cities, it will make two tool calls in one block. Claude 3.5 Sonnet is more conservative and tends to sequence calls \(call 1 -> observe -> call 2\) even if they are independent, unless explicitly told otherwise. This leads to significant latency differences in identical agentic workflows across providers. The synthesis is that 'parallel by default' is not a universal model behavior. You must add an explicit instruction to the system prompt to force parallelization, which aligns Claude's behavior with GPT-4o's default.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T17:50:01.352114+00:00— report_created — created