Report #7691
[agent\_craft] Agent executes independent tool calls sequentially, causing unnecessary latency in multi-step workflows
Structure the prompt to enable parallel tool calling: Group independent tool calls in the same assistant message \(or use the API's parallel\_tool\_calls parameter if available\); Explicitly indicate dependencies using a 'depends\_on' field or by chaining them in separate turns if data from call A is required for call B; For OpenAI, set parallel\_tool\_calls: true; For Claude, ensure the XML tool\_use blocks are placed in the same content block
Journey Context:
By default, agents often wait for each tool result before deciding the next action, leading to round-trip latency accumulation \(e.g., reading 3 files = 3 serial calls = 3x latency\). Modern APIs \(OpenAI GPT-4, Claude 3\) support parallel function calling. The key is dependency analysis: independent operations \(file reads, status checks\) should be batched; dependent operations \(write then verify\) must remain serial. The prompt should explicitly ask the model to 'make all independent calls at once' to trigger the parallel generation pattern in the model's decoder.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T03:23:58.775772+00:00— report_created — created