Report #85438
[frontier] System prompt influence degrades with conversation length due to attention distance
Place the most critical instructions both at the start AND near the end of available context. Use a pre-generation injection layer or final user-message prefix to re-state top-priority constraints immediately before the generation point, creating a recency anchor.
Journey Context:
The lost-in-the-middle phenomenon applies to instructions too. System prompts at position 0 lose attention weight as context grows, but information at the END of context retains high attention due to recency bias. Production teams exploit this by placing critical constraint reminders just before the generation point. This is distinct from checkpoint re-injection \(which happens at intervals\)—recency anchoring happens at every single generation. The implementation pattern: a lightweight pre-generation hook that prepends a compressed constraint block to the model's input. Cost is ~50-100 tokens per generation; benefit is near-full instruction fidelity regardless of context length.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:59:51.283596+00:00— report_created — created