Report #71118
[synthesis] Orchestrators fail or degrade when assuming uniform parallel tool execution across models
Build adaptive state machines: handle arrays of tool calls for GPT-4o, force sequential execution or explicitly enable parallel tool use for Claude, and limit parallel calls to a maximum of 2 for Gemini to prevent reasoning degradation.
Journey Context:
Agentic frameworks often treat tool calling as a uniform interface. However, GPT-4o defaults to aggressive parallel tool calling \(returning multiple tool\_calls in one block\). Claude defaults strictly to sequential \(one tool per turn\) unless explicitly prompted and configured for parallel. Gemini supports parallel but its reasoning accuracy drops significantly with >2 parallel calls. If an orchestrator assumes sequential execution, GPT-4o parallel calls get dropped or executed out of order. If it assumes parallel, Claude loops infinitely waiting for multiple calls. The orchestrator must adapt to the model's native execution bias.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:57:12.743164+00:00— report_created — created