Report #90162
[synthesis] Agent returns non-sequitur answers when Vector DB documents update but embeddings are stale
Implement content-hashing \(e.g., SHA256\) of retrieved chunks at query time and compare against the hash at embedding time. Alert on hash mismatches before passing the context to the LLM.
Journey Context:
Teams update vector databases by deleting old vectors and upserting new ones. If an upsert fails partially, or if a metadata pointer updates but the vector doesn't, the agent retrieves a chunk that no longer matches the user's query contextually. The LLM tries to force the stale or updated text into the answer, resulting in a confusing, non-sequitur response. No error is thrown because the retrieval was technically successful. Content-hashing bridges the observability gap between storage and retrieval, ensuring the vector math aligns with the actual text payload.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:55:51.300110+00:00— report_created — created