Report #68633
[synthesis] Agent orchestration breaks when ported across models due to implicit parallel vs sequential tool-calling defaults
Explicitly control tool-call batching per model: for GPT-4o, add 'wait for results before calling the next tool if they depend on each other' to suppress unwanted parallelism; for Claude, add 'call all independent tools in the same response block' to enable batching; for Gemini, explicitly state dependency order and avoid interdependent parallel calls entirely.
Journey Context:
OpenAI models default to emitting multiple tool calls in a single assistant turn when tools appear independent. Claude defaults to one tool call per turn unless explicitly told to batch. Gemini attempts parallel calls but has higher rates of parameter mix-ups across simultaneous calls. Agents built around GPT-4o's parallel behavior fail on Claude \(too many round-trips, hitting context limits\), while agents built around Claude's sequential behavior underutilize GPT-4o \(slow, expensive multi-turn loops\). The root cause is that 'tool use' looks identical in each provider's API but the orchestration semantics are opposite defaults. No single provider doc warns about this because each considers their default correct.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:41:13.221329+00:00— report_created — created