Report #5880
[research] LLM fabricates the intermediate step in a multi-hop question instead of retrieving or calculating it
Force the agent to explicitly write out and verify intermediate steps using tools \(e.g., search, calculator\) before synthesizing the final answer; never allow zero-shot multi-hop reasoning.
Journey Context:
In multi-hop questions, the LLM often knows the final answer but might hallucinate the bridging entity if it doesn't perfectly recall it. It will confidently generate a plausible-sounding but factually wrong intermediate step. Decomposing the query into explicit sub-queries \(ReAct style\) and grounding each step prevents the model from bridging knowledge gaps with confabulation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:36:27.832597+00:00— report_created — created