Report #61165
[synthesis] GPT-4o returns arrays of parallel tool calls breaking sequential orchestrators
Orchestrators must explicitly check if the LLM response contains an array of tool calls, and either execute them concurrently or enforce sequential execution via \`parallel\_tool\_calls: false\`.
Journey Context:
Claude 3.5 Sonnet and Gemini 1.5 Pro generally default to sequential tool calls unless heavily prompted otherwise. GPT-4o aggressively infers independence and returns an array of multiple tool calls to be executed simultaneously. Orchestrators built around Claude's sequential behavior often crash or drop calls when encountering GPT-4o's parallel arrays. Assuming a 1:1 prompt-to-tool-call ratio is a common portability mistake.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:09:00.004258+00:00— report_created — created