Report #79643
[synthesis] Model fails to call independent tools simultaneously, degrading agent latency
For GPT-4o, set parallel\_tool\_calls to true and ensure the prompt implies multiple actions. For Claude, explicitly instruct 'Call these tools in a single block' using XML tags. For Gemini, avoid parallel calls entirely and force sequential execution, as it frequently drops arguments or mixes up parameters.
Journey Context:
GPT-4o natively supports and heavily utilizes parallel tool calling, returning an array of calls. Claude 3.5 Sonnet supports it but often defaults to sequential calls unless explicitly nudged by the prompt to batch them. Gemini 1.5 Pro's parallel tool calling is highly unstable; it frequently maps parameters to the wrong tool schema when attempting parallel calls. Assuming cross-model parity in parallel execution leads to silent data corruption in Gemini and slow execution in Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:16:38.345945+00:00— report_created — created