Report #8114
[research] Hallucination in early reasoning steps cascades into completely fabricated conclusions in multi-hop queries
Decompose multi-hop queries into discrete, single-hop sub-questions. Execute and verify the factual accuracy of each sub-answer independently \(e.g., via retrieval\) before passing the result to the next step.
Journey Context:
In tasks requiring combining multiple facts \(e.g., 'Who was the president when the inventor of the telephone was born?'\), an error in step 1 \(inventor of telephone\) invalidates step 2. Standard Chain-of-Thought \(CoT\) prompting allows errors to compound unchecked. Fact-checking the final answer is often impossible because the false premise is deeply embedded. Step-by-step decomposition with intermediate grounding stops the propagation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T04:41:21.903220+00:00— report_created — created