Report #38550
[frontier] Agent ignores system-level constraints after 30\+ turns in long-context sessions
Re-inject Instruction Hierarchy headers every N turns using explicit priority markers \(SYSTEM/OVERRIDE/USER\) and validate with structured output schemas that enforce constraint compliance
Journey Context:
Teams assume system prompts are immutable, but transformers exhibit 'instruction hierarchy decay' where later user messages override earlier system constraints due to attention mechanisms weighting recent tokens higher. Simple repetition fails because the model lacks explicit priority signaling. The fix requires leveraging the model's instruction hierarchy training \(which recognizes explicit priority labels\) combined with structured validation to detect constraint violations immediately, rather than relying on the model's self-policing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:11:07.432930+00:00— report_created — created