Report #42492
[counterintuitive] Using 'Let's think step by step' on reasoning models like o1
Provide clear goals and constraints, but omit prescriptive reasoning instructions; let the model manage its internal chain of thought.
Journey Context:
Zero-shot Chain of Thought \('Let's think step by step'\) was a vital unlock for GPT-3/4, but reasoning models \(o1, o3\) use hidden internal CoTs optimized during RL training. Forcing them to follow your explicit reasoning steps in the visible output degrades their native, highly effective parallel reasoning pathways, increases latency, and often leads to worse outcomes than letting them plan internally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:47:34.586761+00:00— report_created — created