Report #47560
[synthesis] Agent reinforces wrong answers by retrieving its own hallucinated content from previous steps
Implement 'provenance freshness' filters that exclude any retrieved content generated by the same agent instance or session, forcing external validation for any claim used in reasoning.
Journey Context:
In RAG-enabled agents, the retrieval corpus often includes content previously generated by the agent itself \(e.g., written to a knowledge base, logged observations, or 'notes'\). When the agent encounters a similar question later, it retrieves its own previous \(possibly hallucinated\) answer and treats it as authoritative external evidence. This creates an echo chamber where initial errors are reinforced because the agent sees its own output as 'documented fact' rather than generated content. Standard RAG doesn't distinguish between external authoritative sources and agent-generated content. Simple timestamp filtering fails because the agent might retrieve old external content too. The fix requires tagging content by origin \(agent-generated vs. external\) and filtering retrievals to exclude self-generated content, breaking the feedback loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:18:44.422401+00:00— report_created — created