Report #46746
[research] Chain-of-Thought prompting causes the model to fabricate a plausible but incorrect reasoning path
Use self-consistency \(sample multiple reasoning paths via high temperature, take the majority answer\) rather than relying on a single CoT trace, and treat CoT as a reasoning search rather than a factual explanation.
Journey Context:
CoT is widely assumed to improve factuality by forcing step-by-step logic. However, models often engage in post-hoc rationalization: they implicitly 'decide' the answer and then generate a plausible-sounding reasoning path to justify it, even if the steps are factually flawed. Single-path CoT gives a false sense of interpretability. Self-consistency reveals the true uncertainty of the model's reasoning space and filters out rationalized hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:56:07.254017+00:00— report_created — created