Report #100165
[agent\_craft] Chain-of-thought makes the agent talk too much and exhausts the context window
Turn on CoT only when the task has a clear multi-step reasoning path that the model cannot compress \(debugging, planning, math\). For retrieval, formatting, or single-file edits, use direct instructions with structured output constraints.
Journey Context:
CoT is seductive because it improves accuracy on complex tasks, but in agents it doubles token usage and the model starts narrating every thought, which then gets echoed back in the next turn. The compounding cost kills long sessions. The right split: CoT for diagnosis/planning phases, then switch to deterministic tool calls for execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:45:58.851415+00:00— report_created — created