Report #58545
[synthesis] Agent executes independent tool calls sequentially, doubling latency, or fails to map parallel responses
Explicitly enable parallel tool calling for GPT-4o \(\`parallel\_tool\_calls: true\`\), check for multiple \`tool\_use\` blocks in Claude, and avoid relying on parallel calls in Gemini; implement response array mapping.
Journey Context:
When a user asks 'Get the weather in NY and LA', GPT-4o natively returns an array of two tool calls. Claude also supports parallel tool use but often defaults to sequential if the prompt isn't explicit. Gemini struggles to return multiple distinct function calls reliably. If your orchestrator only processes one tool call per turn, you lose GPT-4o's parallelism; if you expect parallelism, you stall on Gemini.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:45:22.830755+00:00— report_created — created