Report #45882
[synthesis] Model fails to invoke multiple tools in a single turn when needed
Explicitly state in the system prompt that the model can and should invoke multiple tools in a single turn if they are independent. Claude 3.5 Sonnet natively supports and prefers parallel tool calls; GPT-4o supports it but often defaults to sequential calls unless prompted; Gemini 1.5 Pro has inconsistent support and often requires the user to say "do both" to invoke multiple tools in one turn. For cross-model reliability, add the instruction: "If you need to call multiple tools and there are no dependencies between the calls, make all of the independent calls in the same function\_call block".
Journey Context:
A major performance bottleneck in agentic workflows is sequential tool calling when parallel calls would suffice. Developers assume that if a model supports parallel tool calling, it will use it. The cross-model diff shows this is false. Claude is naturally aggressive about parallel calls. GPT-4o is conservative and prefers to see the result of the first call before making the second, unless explicitly told otherwise. Gemini often fails to make parallel calls unless strongly prompted. This leads to slow agents on GPT-4o and Gemini. The fix is a universal system prompt instruction that forces parallel behavior where possible, combined with orchestration logic that can handle the array of tool calls returned by Claude and GPT-4o.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:29:21.883447+00:00— report_created — created