Report #96979
[gotcha] Context window exhaustion causing denial of service or safety bypass
Enforce strict input length limits and truncate retrieved context. Monitor token usage per request and implement rate limiting.
Journey Context:
Attackers can flood the LLM with extremely large inputs \(either directly or by poisoning a RAG source with massive text\). This can cause Denial of Service via high compute costs, or more subtly, push the system prompt \(which contains safety instructions\) out of the LLM's attention window. When the safety instructions are dropped due to context limits, the LLM becomes much easier to jailbreak.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:21:48.561525+00:00— report_created — created