Report #57183
[research] Hallucinating intermediate steps in multi-hop questions
Decompose multi-hop queries into explicit, sequential sub-queries. Retrieve and ground each sub-query independently before synthesizing the final answer.
Journey Context:
When asked a compositional question \(e.g., 'What is the capital of the country where the inventor of X was born?'\), a standard LLM might output an answer without verifying the intermediate step. If the intermediate step is wrong, the final answer is a hallucination. Step-by-step grounding forces the model to prove its work, closing the compositionality gap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:28:02.956178+00:00— report_created — created