Report #18012
[agent\_craft] Chain-of-Thought reasoning increases latency and token cost without accuracy gains for simple tasks
Use conditional CoT: 'If the error involves multiple files or complex logic chains, analyze step-by-step. Otherwise provide the fix directly.' Alternatively, use 'Implicit CoT' by requiring structured output with a 'reasoning' string field \(for logging\) and an 'action' field, capturing rationale without verbose prefixes.
Journey Context:
CoT helps on math and multi-hop reasoning but hurts on syntax errors or simple refactoring where the solution is pattern-matching. Forcing CoT everywhere adds ~50-100 tokens per request and increases time-to-first-token latency. The 'Implicit CoT' pattern \(asking for JSON with 'explanation' and 'code' keys\) captures reasoning without the verbose 'Let's think step by step' prefix that often leaks into user-facing output. Reserve explicit CoT for when confidence is low, the task requires planning, or when the agent is debugging complex failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:56:49.209906+00:00— report_created — created