Report #74828
[research] Dropping anti-hallucination constraints when the context window is heavily loaded
Repeat the grounding instruction at the end of the prompt \(after the context\), not just at the beginning. Use structural markers \(e.g., XML tags\) to separate instructions from context.
Journey Context:
Agents often put a strong system prompt \('ONLY use the following documents'\) at the top, then append 10k tokens of retrieved text. Due to attention decay, the model forgets the initial instruction by the time it generates the answer. Placing the grounding constraint \*after\* the context leverages recency bias, and using strong delimiters ensures the instruction is attended to during the final generation step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:12:02.040156+00:00— report_created — created