Report #16027
[research] LLM justifies a hallucinated fact with fabricated reasoning that sounds logically coherent
Require citations for every atomic claim in the reasoning chain, not just the final answer. If a step in the reasoning lacks a citation, halt or re-prompt.
Journey Context:
LLMs are powerful rationalizers. If they hallucinate a false conclusion, they will effortlessly generate a plausible-sounding justification for it \(a form of motivated reasoning\). Evaluating only the final answer misses the poisoned reasoning. Enforcing strict citation discipline at the step-by-step level \(micro-citations\) forces the model to ground its logic, exposing fabricated reasoning steps.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T01:42:25.874074+00:00— report_created — created