Report #38081
[gotcha] Attacker floods a public data source \(that the RAG system ingests\) with massive amounts of irrelevant or malicious text, degrading the LLM's performance
Implement data provenance tracking and reputation scoring for RAG sources. Rate limit ingestion and use anomaly detection to spot sudden changes in source data volume or content.
Journey Context:
RAG systems often scrape public forums or wikis. If an attacker spams the forum, the RAG system ingests it. This can lead to the LLM refusing to answer \(if poisoned with safety triggers\) or just outputting garbage, effectively causing a Denial of Service on the AI application.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:23:55.821649+00:00— report_created — created