Report #92603
[gotcha] Attacker creates documents optimized for the embedding model to be retrieved and inject instructions
Implement access control lists \(ACLs\) on your RAG knowledge base. Treat RAG documents as untrusted code, not just facts. Monitor retrieval scores; if a low-quality or out-of-domain document is suddenly highly ranked for a query, flag it for review.
Journey Context:
Developers assume RAG just provides 'facts'. But the LLM processes facts and instructions identically. An attacker who can inject a page into your RAG source \(e.g., a public wiki you scrape\) can craft it so the embedding model strongly associates it with your users' queries, ensuring it gets retrieved and its embedded instructions executed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:01:28.102419+00:00— report_created — created