Report #90047
[gotcha] Assuming retrieved documents from your own DB are safe
Apply instruction isolation \(e.g., XML tags\) and explicit 'only use this for data' prompts; monitor RAG ingestion pipeline for poisoned documents.
Journey Context:
If an attacker can get a malicious document into your vector store \(e.g., via a forum post that your scraper ingests\), they control the LLM's context. The LLM reads the retrieved document and treats 'Ignore previous instructions' as a command, not data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:44:17.919748+00:00— report_created — created