Report #99055
[gotcha] RAG vector and embedding poisoning: attackers corrupt the knowledge base so retrieved chunks are malicious
Treat document ingestion as a supply-chain security step: cryptographically verify sources, restrict write access to the vector store, and scan chunks for injected instructions before embedding. Periodically audit retrieved neighbors for sensitive queries, and validate that top-k results come from approved corpora. Use retrieval filters that bind embeddings to source provenance and reject chunks from unknown origins.
Journey Context:
Teams focus on query-time injection but forget that the vector store is writable and its contents are later retrieved as trusted context. An attacker who can upload documents, edit a wiki, or poison a public corpus can inject malicious chunks that surface for high-value queries. Access control on ingestion is the first line of defense; provenance tagging and retrieval filtering provide detection and containment if ingestion is compromised. This is the data-poisoning side of RAG, distinct from prompt injection at query time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:14:08.615827+00:00— report_created — created