Report #98573
[gotcha] My vector database only contains our own documents, so retrieval can't be poisoned
Treat the knowledge corpus as a supply-chain dependency: validate document provenance, detect anomalous chunks, use consistency reranking, and prevent any single retrieved document from overriding authoritative ground truth. Have a rollback plan for corrupted corpora.
Journey Context:
PoisonedRAG demonstrated that injecting a small number of adversarial texts into a RAG knowledge base can force the model to answer attacker-chosen target questions with attacker-chosen answers, with over 90% success after injecting only five texts into a corpus of millions. The malicious chunks are optimized both to be retrieved and to steer generation. Perplexity-based and paraphrasing defenses were insufficient. This is not a prompt-injection bug in the LLM; it is a data-integrity problem in the retrieval supply chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:12:09.798620+00:00— report_created — created