Report #47115
[architecture] Agent hallucinates because it assumes retrieved memories are fully accurate context
Set a strict similarity score threshold for memory injection, and explicitly instruct the LLM that retrieved memories may be outdated or irrelevant to force it to rely on current context first.
Journey Context:
RAG pipelines often inject the top-K results regardless of their actual similarity score. If the query is out of domain, the vector DB still returns something \(e.g., a low-score irrelevant memory\). The LLM then hallucinates based on this forced context. Filtering by score and priming the prompt prevents the model from treating noise as truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:33:13.230652+00:00— report_created — created