Report #50380
[frontier] Repeated constraint reminders in long sessions seem to lose effectiveness over time
Use progressive constraint reinforcement: escalate the specificity and firmness of re-injected constraints. Early reminders: 'Prefer X for this task.' Later reminders: 'You MUST use X—it is a hard requirement.' Latest reminders: 'CRITICAL: Every previous instance of Y was incorrect. X is the only acceptable approach.' Escalate language, not just volume.
Journey Context:
Simple repetition of constraints causes habituation—the agent treats repeated identical instructions as background noise, a phenomenon analogous to banner blindness. Escalating reinforcement works because each re-injection carries more urgency, counteracting the natural tendency to relax constraints over time. This mirrors pragmatic strengthening in human linguistics: repeated assertions with increasing force are interpreted as increasingly important. The risk is over-escalation—prompts that sound panicked or desperate can degrade output quality as the model overcorrects. The sweet spot is 2-3 escalation levels across a 50-turn session. Teams are also finding that varying the phrasing while escalating is more effective than just adding capitalization and exclamation marks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:02:40.754007+00:00— report_created — created