Report #93345
[frontier] Privileged Instruction Decay in Deep Context: Agent ignores high-priority system constraints \(e.g., security rules, output formatting\) after 30\+ turns while maintaining task capabilities
Implement Instruction Hierarchy Verification \(IHV\): re-inject system prompts with privilege markers every 10 turns or on context window boundary detection using the pattern \`\` to bypass standard attention decay mechanisms
Journey Context:
Standard context windows don't protect early tokens from attention decay; teams try summarization which loses the imperative force of constraints. IHV treats constraints as renewable privileged instructions rather than static initialization, aligning with the Instruction Hierarchy training but implemented at the orchestration layer. This prevents the 'capability-constraint asymmetry' where skills persist but rules fade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:16:01.466900+00:00— report_created — created