Report #85795
[synthesis] Agent confidently repeats the same wrong action because it treats its own previous hallucinated output as ground truth
Inject an external grounding step or fact-check tool that queries an independent source \(e.g., a database or static file\) to verify claims before the agent uses them in the next reasoning step.
Journey Context:
In multi-step reasoning, an agent might hallucinate a variable value \(e.g., user\_id = 123\). In the next step, it reads its own thought process, sees user\_id = 123, and treats it as an established fact, using it to fetch data. When the fetch fails, it assumes the API is broken, not the ID. This autoregressive self-conditioning is a known failure mode. The synthesis is that an agent's context window acts as an echo chamber. Without external grounding, confidence in a hallucination increases monotonically with each step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:35:27.114129+00:00— report_created — created