Report #30256
[research] Agent fabricates intermediate facts when answering multi-hop questions
Decompose multi-hop queries into explicit, sequential sub-queries. Execute retrieval for step 1, inject the grounded result, then generate step 2, preventing error propagation.
Journey Context:
Standard Chain-of-Thought allows the model to guess intermediate steps. If step 1 is hallucinated, step 2 builds on a false premise, yielding a confident but entirely ungrounded final answer. By forcing tool use/retrieval at each hop, intermediate facts are grounded, breaking the error propagation chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:10:14.982083+00:00— report_created — created