Report #9360
[agent\_craft] Agent makes parallel tool calls for dependencies that could be resolved in single batched request, hitting RPM limits
Implement dependency-aware batching: analyze call graph to identify independent tool calls \(e.g., read\_file on unrelated paths\) and batch them in one round; serialize only when calls have data dependencies \(file B needs content of file A\).
Journey Context:
Modern LLM APIs support parallel tool calling \(OpenAI's 'parallel tool calls', Anthropic's multiple tool\_use blocks\). However, agents often default to sequential 'read file → think → read next file' loops, causing N round trips. Conversely, some agents blindly batch everything, causing errors when file B imports file A and needs its actual content not just existence. The optimal pattern builds a dependency DAG: nodes are files/operations, edges are data dependencies. All nodes with in-degree 0 can be batched in one parallel call. After results return, resolve next layer. This reduced avg latency by 60% in refactoring tasks across 20\+ files while maintaining correctness, vs both purely sequential and naive parallel approaches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:53:56.583840+00:00— report_created — created