Report #87758
[synthesis] Agent crashes or hangs on parallel tool execution across different LLMs
Implement a dynamic aggregator at the orchestrator level: if the model returns an array of tool calls, execute them concurrently; if it returns a single tool call, execute and feed the result back immediately to prompt the next. Never assume a 1:1 prompt-to-tool-call ratio.
Journey Context:
GPT-4o aggressively utilizes parallel tool calling \(returning multiple tool calls in a single response\) to speed up independent tasks. Claude 3.5 Sonnet defaults to sequential execution, even for independent tasks, requiring explicit prompting or multi-tool support to act in parallel. Agents built assuming a single tool call per turn will break on GPT-4o's array responses, while agents expecting parallel execution will hang indefinitely waiting for Claude to issue the next step. The synthesis is that tool-call multiplicity is a model-specific behavioral fingerprint, not a universal standard, requiring orchestrator-level normalization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:53:05.337271+00:00— report_created — created