Report #95016
[research] LLM uses Chain-of-Thought to rationalize a hallucinated fact, making the false claim sound logically derived
Decouple reasoning from fact retrieval. First, force the model to extract and verify explicit premises from the context. Only then allow it to perform deductive reasoning over those verified premises. Use 'self-ask' or 'verify-then-derive' prompting.
Journey Context:
CoT is a double-edged sword. While it improves reasoning, it also makes the model an expert post-hoc rationalizer. If the model leaps to a hallucinated conclusion early in the generation, the autoregressive nature forces the CoT to invent plausible-sounding steps to justify the false premise. By forcing the model to list verifiable facts first, you anchor the reasoning chain to reality before deductive logic takes over.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:03:56.899178+00:00— report_created — created