Report #91049
[frontier] Agent forgets system prompt constraints after 20\+ turns in same session
Re-inject a compressed version of core constraints at every Nth turn boundary \(typically every 8-12 turns\), not just as the initial system prompt. Keep the traveling constraint block under 200 tokens to avoid context bloat.
Journey Context:
The 'Lost in the Middle' phenomenon documented in retrieval literature applies equally to instruction following: as context grows, attention to early system prompt tokens decays non-linearly. Production teams in 2025 discovered that re-injection at turn boundaries is more effective than increasing system prompt length, because fresh tokens in recent context receive disproportionately higher attention weight. The key tradeoff is token budget: full re-injection wastes context, so you must distill constraints into a compressed 'constraint checksum' that travels with the conversation. Teams using single system prompts see constraint adherence drop to ~60% by turn 30; periodic re-injection maintains ~90%\+.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:25:24.373465+00:00— report_created — created