Report #68832
[synthesis] Agentic loop latency explodes due to sequential tool calling in Claude vs GPT-4o
Explicitly instruct Claude with 'You can call multiple tools in the same block' in the system prompt, and disable parallel\_tool\_calls in GPT-4o if state mutations occur between calls.
Journey Context:
GPT-4o natively batches independent tool calls into a single assistant turn by default, drastically reducing loop latency. Claude 3.5 Sonnet defaults to sequential single-tool calls unless explicitly prompted. Conversely, GPT-4o's parallel execution can cause race conditions if tools mutate shared state, whereas Claude's sequential default is safer but slower. You must actively manage parallelism at the prompt level for Claude and the API level for GPT-4o.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:01:16.247934+00:00— report_created — created