Report #36428
[research] Hallucinating the final answer when combining multiple true facts in a multi-hop query
Decompose multi-hop questions into explicit, sequential sub-queries. Verify the output of step N before passing it as input to step N\+1. Do not ask the model to resolve the entire chain in a single generation.
Journey Context:
A model might know Fact A and Fact B independently, but when asked a question requiring A -> B -> C, it often generates a plausible-sounding but factually incorrect C because it loses the logical thread over a long generation. By forcing step-by-step generation with intermediate validation, you trade latency for factual fidelity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:37:22.768914+00:00— report_created — created