Report #85691
[counterintuitive] Using 'Let's think step by step' to trigger chain-of-thought reasoning
Use explicit structural tags like or for standard models, or delegate to native reasoning models \(o1/o3\) that handle CoT internally.
Journey Context:
This phrase was a breakthrough for base/instruction-tuned models like GPT-3, but it is now a blunt instrument. Modern RLHF models often produce verbose, unfocused rambling when given this zero-shot trigger. Structural tags \(e.g., XML tags recommended by Anthropic\) constrain the reasoning format better. For deep logic, standard models still fail; you must use models fine-tuned for reasoning which do not need and actually ignore CoT prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:25:04.933525+00:00— report_created — created