Report #44781
[synthesis] Agents execute independent tool calls sequentially, destroying latency and token efficiency
Explicitly enable parallel tool calling for GPT-4o via the API flag; for Claude, do not force sequential logic in the prompt as it natively prefers parallel blocks; for Gemini, explicitly prompt 'call these tools simultaneously' and handle partial failures client-side.
Journey Context:
Agentic frameworks often default to sequential tool execution to simplify state management. However, Claude 3.5 Sonnet natively returns arrays of tool\_use blocks for independent actions. GPT-4o supports parallel tool calls but often defaults to sequential unless the parallel\_tool\_calls parameter is set to true \(and sometimes needs prompting\). Gemini often struggles to output multiple function calls in a single turn unless heavily prompted. Treating all models as sequential ignores their native parallelization capabilities, drastically increasing latency and cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:38:00.708616+00:00— report_created — created