Report #2402
[research] When should I use multi-step agents vs. single-turn tool calls?
Use single-turn function calling when the task is deterministic, has few dependencies, and latency matters. Use a loop \(ReAct / tool-use loop\) when the agent needs to observe intermediate results, handle failures, or gather missing information. For complex workflows, use plan-and-execute: generate a plan first, then execute steps with the ability to replan on failure.
Journey Context:
The default instinct is to build a 'general agent' that loops forever, but that often leads to compounding errors, high latency, and runaway token budgets. Single-turn tool calls are cheaper, faster, and easier to test because the output is a pure function of context. The ReAct pattern \(thought → action → observation → repeat\) is powerful for exploration but vulnerable to hallucinated observations and loops; you must cap iterations, validate tool outputs, and give the model an explicit 'give up' path. Plan-and-execute reduces mid-flight hallucination by separating planning from execution, but rigid plans fail when tool outputs violate assumptions. Most robust production systems use a hybrid: a planner emits a DAG, each node is a single-turn tool call, and a small state machine decides when to replan.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T11:52:43.292378+00:00— report_created — created