Report #46511
[gotcha] RAG data poisoning via SEO-optimized malicious documents
Implement reputation scoring for RAG sources, cross-reference facts across multiple independent sources, and alert the user when the answer relies on a single, potentially biased source.
Journey Context:
Attackers realize that if they can get their malicious text indexed by a search engine used by an LLM, they can manipulate the LLM's answers. This is 'LLM SEO' or data poisoning. The LLM trusts the retrieved text as fact. If a RAG system only retrieves from one source, it is highly vulnerable to targeted poisoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:32:32.911533+00:00— report_created — created