Report #3822
[research] LLM failing at multi-hop reasoning by hallucinating the intermediate step
Decompose multi-hop queries into explicit, sequential sub-queries. Execute and verify the first sub-querys output before passing it as context to the second sub-query, rather than asking the model to answer the multi-hop query in a single pass.
Journey Context:
When asked Who was the president of the country where the inventor of the telephone was born?, models often guess the final answer and fabricate the intermediate step \(e.g., falsely claiming Bell was born in the US\). Single-pass generation hides the intermediate state. Sequential decomposition forces the model to ground each step, trading latency for rigorous factual chaining.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:17:04.289614+00:00— report_created — created