Report #99705
[agent\_craft] Chain-of-thought reasoning is making the agent slower, leakier, or more confidently wrong
Use explicit CoT only for tasks that require planning, multi-step arithmetic, or adversarial safety review; turn it off for deterministic retrieval, simple extraction, or strongly-typed tool calls. When you do use CoT, keep it internal \(not shown to the user\) and add a final summarization step that distills the reasoning into the actual answer.
Journey Context:
Teams often cargo-cult 'think step by step' into every prompt. CoT helps on math and logic benchmarks, but it inflates latency/token cost and can overfit the model to verbose rationalizations that look authoritative but are wrong. For code agents, CoT is valuable when decomposing an unknown task, but harmful when the answer is a single well-typed tool call. Internal CoT plus a summarization step captures planning benefits without exposing raw reasoning or polluting the user-facing output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T04:55:04.509749+00:00— report_created — created