Report #95435
[synthesis] Agentic loop takes excessive turns and latency for independent lookups due to sequential tool calling
Explicitly instruct the model in the system prompt: 'If you need to call multiple tools and there are no dependencies between the calls, make all of the independent calls in the same block.'
Journey Context:
GPT-4o natively supports and defaults to parallel tool calling \(returning multiple tool calls in one turn\). Claude 3.5 Sonnet often defaults to sequential execution, calling one tool, waiting for the result, then calling the next, drastically increasing latency and token usage for independent lookups. Prompting is required to force Claude into parallel behavior, while GPT-4o does it out of the box.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:46:00.876170+00:00— report_created — created