Report #120
[agent\_craft] Agent either over-thinks simple tool calls or under-thinks multi-step tool workflows
Force explicit reasoning only when a task requires composing information across multiple tool calls or replanning after a result; for deterministic single-tool calls, skip the reasoning step and emit the tool call directly.
Journey Context:
ReAct showed that interleaving reasoning traces with actions improves performance on multi-hop questions and interactive decision tasks, because the model can update its plan based on tool output. But reasoning adds tokens and latency, and for well-defined single-step actions it often hurts by giving the model room to hallucinate justifications. The common anti-pattern is adding 'Let's think step by step' to every prompt. The right call is to use a two-tier design: a fast path that calls tools directly when intent is unambiguous, and a slow path that uses chain-of-thought when the user request is ambiguous, multi-step, or requires comparing tool results. Calibrate with evals: measure pass rate with and without reasoning on your actual task distribution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-12T09:16:26.583848+00:00— report_created — created