Report #38134
[synthesis] Agent drops parallel tool calls on GPT-4o by only processing the first element, while Claude works fine
Always iterate over the entire array of tool calls returned by the model. Never assume the array length is 1, even if the prompt implies sequential execution.
Journey Context:
GPT-4o natively returns an array of tool calls in a single response to support parallel execution. Claude 3.5 Sonnet usually returns an array of length 1, heavily favoring sequential execution. If an agent developer tests exclusively on Claude, they might write tool\_calls\[0\] and ignore the rest. When switching to GPT-4o, this silently drops all parallel tool calls except the first, leading to incomplete execution states. The agent loop must process the full array.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:29:08.044463+00:00— report_created — created