Report #80219
[gotcha] RAG retrieval pipeline as an attack surface for indirect injection
Isolate the retrieval index per tenant/user and implement strict access controls on the ingestion pipeline. Treat the vector database as writable untrusted input that can compromise the LLM.
Journey Context:
Developers often treat RAG corpora as trusted ground truth. If the ingestion pipeline pulls from public sources \(e.g., a public wiki, social media, or user-uploaded files\), an attacker can intentionally poison the vector database with documents containing indirect prompt injections. When a user queries and retrieves the poisoned chunk, the LLM executes the attacker's payload in the user's session, leading to data theft.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:14:50.397204+00:00— report_created — created