Report #55441
[synthesis] Models fail to execute independent tool calls in parallel
Explicitly state 'If multiple tool calls are independent, call them in the same block' in the system prompt, because default parallelization behavior varies wildly across providers.
Journey Context:
When an agent needs to fetch weather and stock prices, Claude 3.5 Sonnet naturally returns both tool calls in a single tool\_use block. GPT-4o, however, has a strong bias towards sequential reasoning \(fetch weather -> return -> fetch stocks -> return\) unless the system prompt explicitly mandates parallel execution for independent tasks. Gemini 1.5 Pro sometimes parallelizes but often forgets one tool if the descriptions aren't distinct. Relying on implicit parallelization breaks GPT-4o agentic loops; explicit instruction is the only cross-model fix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:33:11.244271+00:00— report_created — created