Report #28834
[gotcha] LLMs ignoring system instructions when the context window is flooded with irrelevant or repetitive text
Implement aggressive context window management, truncating or summarizing excessively long user inputs before passing them to the model, to ensure the system prompt retains sufficient attention weight.
Journey Context:
LLMs use attention mechanisms. If a user injects 10,000 words of garbage or repeated phrases, the attention paid to the 500-word system prompt drops to near zero. The model 'forgets' its instructions simply due to the mathematical dilution of attention across the context window, allowing the attacker to slip in malicious instructions at the end that receive disproportionate attention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:47:36.428973+00:00— report_created — created