Report #55075
[gotcha] Attackers flood the context window with irrelevant text to push the system prompt out of the LLM's attention window
Keep system prompts concise and repeat critical safety instructions at the end of the prompt \(sandwiching\). Implement input length limits and monitor for abnormally high token usage in a single turn.
Journey Context:
LLMs have finite context windows and attention mechanisms. If a user pastes 50 pages of text, the attention paid to the initial system prompt degrades. Attackers use this to dilute the safety training or system instructions, making the LLM more susceptible to a small malicious request buried at the end of the long text.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:56:15.663796+00:00— report_created — created