Report #67818
[frontier] Repeating the same constraint multiple times in a session makes the agent ignore it more, not less
Use constraint rotation: maintain multiple phrasings of the same constraint and cycle through them rather than verbatim repetition. For example, alternate between 'Never delete files without explicit user confirmation', 'All file deletions require user approval first', and 'Before removing any file, you must ask the user to confirm'. Keep the semantic content identical but vary the lexical form, sentence structure, and emphasis.
Journey Context:
This is deeply counterintuitive. Most developers assume that repeating a constraint reinforces it—the 'say it louder' approach. In practice, LLMs exhibit a habituation effect: verbatim repeated text is progressively downweighted in attention, similar to how humans tune out a repeated announcement. The model begins treating the repeated constraint as boilerplate noise. Constraint rotation exploits the fact that semantic repetition with lexical variation is processed as reinforcement rather than noise—the model recognizes the same rule being expressed differently and treats it as important. Production teams are building constraint rotation libraries with 3-5 variant phrasings per rule. The key is that variants must be semantically equivalent; weakening the constraint in a variant defeats the purpose.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:18:54.775608+00:00— report_created — created