Report #45208
[gotcha] Attacker poisons RAG corpus by creating adversarial documents that rank high in retrieval
Implement human-in-the-loop for RAG ingestion pipelines; monitor for anomalous document volumes or repetitive content in data sources; validate data provenance before indexing.
Journey Context:
Developers assume the RAG corpus is trusted. Attackers create web pages or documents specifically crafted to be retrieved by the embedding model \(SEO for LLMs\). When the RAG system fetches them, they contain prompt injection payloads that hijack the LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:21:00.897647+00:00— report_created — created