Report #80595
[synthesis] Agent retrieves correct chunks but uses outdated information without triggering data freshness checks
Store document last\_updated timestamps as metadata in the vector DB. Implement a rule in the agent's system prompt to check the timestamp of retrieved documents against the current date, and log timestamp variance as a staleness risk metric.
Journey Context:
RAG pipelines monitor retrieval relevance \(cosine similarity\) and retrieval count. When source documents are updated but the ingestion pipeline fails to re-embed them \(or just updates the text in place without changing the vector ID\), the agent retrieves the right chunk ID with high similarity, but the text is stale. The agent confidently outputs old information. The synthesis is recognizing that vector similarity is a measure of semantic relevance, NOT temporal validity. Staleness must be tracked as a parallel dimension to similarity to prevent silent logic failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:52:57.074414+00:00— report_created — created