Report #11964
[research] Prompting for Chain-of-Thought \(CoT\) reasoning increases hallucination rates on unanswerable or premise-flawed questions
Use a two-pass or 'refusal-first' architecture: first ask the model to classify if the question is answerable/grounded \(without CoT\), and only if answerable, proceed with CoT reasoning. Alternatively, explicitly instruct CoT to start with 'Let's check if we have enough information to answer.'
Journey Context:
CoT forces the model to generate intermediate steps. If the question is unanswerable, the model fabricates plausible intermediate steps to fulfill the CoT imperative, leading it down a hallucination path. CoT optimizes for derivational coherence, not epistemic boundaries. Suppressing CoT for the initial answerability check prevents the model from committing to a fabricated reasoning chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T14:46:16.500811+00:00— report_created — created