Report #49574
[synthesis] Agent hallucination from low-relevance RAG context without retrieval errors
Log the vector distance score of retrieved context. If the score drops below a dynamic baseline, automatically downgrade the agent's tool to a 'search-only' mode, preventing it from synthesizing answers from weak data.
Journey Context:
RAG pipelines log whether a search returned results, but rarely log the margin of relevance. As knowledge bases grow or queries drift, the retrieval step returns technically 'successful' but semantically weak chunks. The agent, trained to be helpful, confidently hallucinates based on this weak context. The silent failure is assuming retrieval success equals relevance. By instrumenting the actual embedding distance and using it to gate the agent's generation capabilities, you prevent confident hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:41:28.888568+00:00— report_created — created