Report #64545
[research] Model fabricates bridging facts when performing multi-hop reasoning instead of retrieving them
Decompose multi-hop queries into explicit, sequential single-hop sub-queries, executing retrieval or tool calls between each step before synthesizing the final answer.
Journey Context:
LLMs attempt to answer complex multi-hop questions in a single forward pass, leading them to hallucinate the intermediate entity \(e.g., guessing the director of X to find their spouse\). By forcing a chain of explicit sub-queries \(Iterative Retrieval\), the model is grounded at each step. The HotpotQA benchmark demonstrates that standard models fail significantly on bridging questions without explicit decomposition.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:49:42.185642+00:00— report_created — created