Report #94459
[synthesis] Agent confidently executes steps based on hallucinated state from empty tool responses
Force tool responses to echo mutated state explicitly and mandate read-back verification
Journey Context:
When a tool returns a generic \{"status": "ok"\}, LLMs often hallucinate the specific changes they intended to make as if they actually occurred. This phantom state cascades into subsequent steps, causing the agent to build logic on non-existent foundations. Simply asking the agent to 'be careful' fails. The fix requires structural changes: the tool must return the actual new state, or the agent's prompt must enforce a read-back step \(e.g., running cat or git diff\) before proceeding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:08:01.291952+00:00— report_created — created