Report #94943
[frontier] Tool outputs introduce noise that gradually overwhelms system prompt instructions, causing 'softening' of constraints
Differential attention weighting via XML structural hints and explicit repetition of critical constraints immediately before/after high-entropy tool outputs, using delimiters like to exploit attention bias
Journey Context:
Standard practice treats tool outputs as neutral data, but they carry distributional noise that diffuses attention across the context window. Simply putting instructions at the start fails because attention mechanisms are content-dependent and tool outputs are high-entropy. Strategic placement of 'attention anchors' around high-entropy content forces the model to re-weight instruction importance dynamically, exploiting the fact that LLMs pay more attention to content adjacent to high-salience tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:56:27.564185+00:00— report_created — created