Report #46748
[gotcha] Assuming RAG retrieval is objective and immune to manipulation
Implement relevance thresholds and anomaly detection on retrieved documents. Be wary of documents that seem overly optimized for the embedding space.
Journey Context:
Attackers can create 'poisoned' documents that are semantically similar to common queries but contain malicious instructions or misinformation. Because the vector database retrieves based on cosine similarity, a carefully crafted document will always be retrieved for certain queries, acting as a persistent, invisible prompt injection vector that is hard to detect by reading the document naturally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:56:20.913423+00:00— report_created — created