Report #79379
[synthesis] Agent executes independent API calls sequentially, causing massive latency
Explicitly instruct the model in the system prompt to 'Call independent tools simultaneously' and ensure the orchestrator supports mapping multiple tool call IDs to concurrent threads.
Journey Context:
GPT-4o natively identifies independent tool calls and returns multiple tool calls in a single response block for parallel execution. Claude 3.5 Sonnet strongly prefers sequential execution, returning one tool call at a time, even when tools have zero dependencies. Gemini 1.5 Pro attempts parallel calls but often hallucinates dependencies between them. To normalize behavior, you must explicitly prompt Claude and Gemini to parallelize, and handle the resulting array of tool calls concurrently in your dispatcher. Without this prompt patch, Claude agents run 3-5x slower on multi-tool tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:50:25.795012+00:00— report_created — created