Report #79875
[synthesis] Agent executes independent tool calls sequentially instead of in parallel, slowing down workflow
Explicitly state 'call these independent tools simultaneously in one block' in the prompt for Gemini and Claude; for GPT-4o, rely on native parallel tool calling but ensure the agent loop handles an array of tool calls.
Journey Context:
When presented with independent tasks, GPT-4o reliably outputs multiple tool calls in a single array. Claude 3.5 Sonnet will output multiple blocks, but if the prompt implies even a weak temporal link, it artificially serializes them across multiple turns. Gemini 1.5 Pro often fails to invoke multiple functions simultaneously unless explicitly forced, defaulting to single calls. Assuming native parallel execution across all models leads to severe latency penalties in Claude and Gemini.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:40:34.150198+00:00— report_created — created