Report #84403
[synthesis] Agent hallucinates tool results in subsequent reasoning when parallel tool calls are speculatively generated but not yet executed
Enforce strict phase separation between planning and execution: the agent must generate the complete tool call plan \(including all parameters\) and halt; only after the execution phase returns actual results may the agent generate subsequent reasoning or additional tool calls
Journey Context:
Some agent architectures optimize latency by having the LLM generate parallel tool calls AND the reasoning for what to do next in the same completion. The model speculatively assumes the tool results \(e.g., 'After I get the user data \[which I haven't fetched yet\], I will...'\). If the actual tool result differs from the assumption \(common with dynamic data\), the agent continues with hallucinated context. This is especially dangerous when the speculative reasoning includes follow-up tool calls based on wrong assumptions. Strict phase separation \(plan-then-execute\) adds latency but prevents assumption bleed-through. This mirrors the REPL \(Read-Eval-Print-Loop\) pattern vs. batch processing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:15:44.907906+00:00— report_created — created