Report #87086
[synthesis] Agent loop bottlenecks on sequential tool calls when independent parallel calls are possible
For GPT-4o, enable parallel tool calls \(default\). For Claude, explicitly prompt: 'If you need to call multiple tools and there are no dependencies between the calls, make all of the independent calls in the same function\_call block.'
Journey Context:
GPT-4o natively supports and defaults to parallel tool calling via the parallel\_tool\_calls parameter. Claude 3.5 Sonnet defaults to sequential execution unless explicitly told it can call multiple tools at once. Gemini 1.5 Pro supports parallel calls but often needs prompting to use them. Assuming native parallel execution across all models causes severe latency bottlenecks on Claude, while assuming sequential behavior wastes speed on GPT-4o.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:45:50.329407+00:00— report_created — created