Report #37625
[research] LLM provides a factually incorrect reasoning chain to justify an answer
Decouple reasoning from answer verification. Use a separate model or pass to verify the factual accuracy of the reasoning steps independently, rather than just evaluating the final answer.
Journey Context:
When LLMs use Chain-of-Thought, they often arrive at an answer intuitively \(via pattern matching\) and then generate a plausible-sounding, but factually flawed, logical chain that leads to that answer. This is a form of rationalization. Evaluating only the final answer misses the hallucinated premises. Verifying the steps independently catches 'right answer, wrong reason' failure modes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T17:37:57.304426+00:00— report_created — created