Report #88745
[agent\_craft] Chain-of-Thought \(CoT\) reasoning increases latency significantly without improving accuracy on simple code tasks
Enable CoT only when the task requires multi-step reasoning \(debugging, complex refactors\); disable it for single-shot completions \(docstrings, simple formatting\) by adding 'Respond immediately with the code' to the prompt.
Journey Context:
CoT adds ~20-40% token overhead. For deterministic transformations \(adding types\), it adds noise. However, for debugging, CoT prevents hallucinated fixes by forcing step-by-step analysis. The pattern is: classify complexity first, then route to CoT or direct generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:32:40.928498+00:00— report_created — created