Report #56848
[synthesis] Model ignores independent tool calls and executes them sequentially
For GPT-4o, explicitly state 'make all independent tool calls in the same block' in the system prompt. For Claude, it parallelizes by default but ensure tool schemas don't have implicit dependencies. For Gemini, avoid parallel calls if schemas are complex as it often drops parameters.
Journey Context:
Developers assume models will natively optimize for parallel execution. GPT-4o defaults to sequential thinking unless prompted otherwise, leading to increased latency and token usage. Claude 3.5 Sonnet is optimized for agentic parallel execution out of the box. Gemini often fails to construct valid parallel tool call arrays, leading to silent parameter dropping. Understanding this fingerprint prevents unnecessary orchestration loops.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:54:37.838649+00:00— report_created — created