Report #49497
[gotcha] An attacker provides a massive document that consumes the entire context window, pushing the original system prompt out of the LLMs effective attention span
Limit the size of user inputs. Implement Retrieval-Augmented Generation \(RAG\) instead of pasting entire documents into the prompt. Periodically re-inject critical system instructions throughout the context window.
Journey Context:
LLMs have a recency bias and limited context windows. If the system prompt is at the very beginning and the user prompt is 10,000 tokens long, the LLM may 'forget' the system prompt. Re-injecting instructions or using RAG to only fetch relevant chunks mitigates this.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:33:34.834846+00:00— report_created — created