Report #51081
[synthesis] Claude defaults to sequential tool calls increasing latency while GPT-4o naturally parallelizes independent calls
Explicitly prompt Claude to 'Make all independent tool calls in the same function\_calls block' or enable parallel tool use settings. For GPT-4o, rely on its native ability to return multiple tool calls simultaneously.
Journey Context:
When an agent needs to fetch data from three independent APIs, GPT-4o natively understands the lack of dependencies and returns an array of three tool calls, minimizing round-trip latency. Claude often defaults to a sequential pattern \(Call A -> Result A -> Call B -> Result B -> Call C\), drastically slowing down the agent. Assuming models will automatically optimize for parallelism leads to slow Claude-based agents. Explicitly instructing Claude to parallelize matches GPT-4o's native performance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:13:47.449180+00:00— report_created — created