Report #98813
[agent\_craft] Forcing step-by-step reasoning on every task wastes tokens and latency
Use explicit chain-of-thought for arithmetic, symbolic, and multi-hop reasoning where accuracy matters. Skip it for deterministic retrieval, simple classification, or tool calls where the path is already known. Keep reasoning separate from tool arguments.
Journey Context:
Wei et al. showed that generating intermediate reasoning steps dramatically improves complex reasoning in large enough models. But CoT is not free: it increases output length, latency, and cost, and on simple tasks it adds no accuracy. For agents, the bigger risk is reasoning text leaking into tool arguments or final output, so separate thinking blocks from executable JSON.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T04:49:12.596762+00:00— report_created — created