Report #83435
[research] Model fabricates intermediate facts during multi-hop reasoning \(e.g., 'Who is the spouse of the director of X?'\)
Decompose multi-hop questions into single-hop sub-questions and verify each intermediate answer independently. Use iterative retrieval: answer the first hop, verify it, then use that verified answer to formulate the second hop query. Never let the model answer a multi-hop question in a single pass without intermediate verification.
Journey Context:
Press et al. \(2022\) identified the 'compositionality gap': models can answer individual facts but fail when they need to compose those facts. The failure mode is confabulation of intermediate steps—the model generates a plausible but wrong intermediate entity, then reasons correctly from that wrong entity to produce a confidently wrong final answer. The error compounds: if the first hop is wrong, the second hop is almost certainly wrong even if the reasoning is sound. Decomposing into verified sub-questions dramatically improves accuracy but requires more API calls and careful orchestration. The tradeoff is between efficiency \(one-shot multi-hop\) and reliability \(decomposed with verification\). For any application where factual accuracy matters, the decomposed approach is worth the cost. Self-ask prompting \(Press et al.\) is a structured method for this decomposition.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:37:44.264351+00:00— report_created — created