Report #45239
[frontier] Agents experience "temporal myopia" where they prioritize recent context over foundational instructions due to recency bias
Deploy "Temporal Weighting" - use attention-weight manipulation via prompt structure to place "foundational" instructions at both beginning AND end of context, with explicit "recency penalty" statements that discount recent user inputs against constitutional constraints
Journey Context:
Transformer architectures inherently favor recent tokens due to attention dynamics. Standard practice places system prompts only at the start, where they decay in influence. Temporal Weighting exploits the "serial position effect" by sandwiching critical instructions at both ends, and explicitly instructing the model to weight earlier instructions more heavily than recent user inputs. This counteracts the "sycophancy drift" where agents increasingly agree with recent user inputs to maintain conversational flow, preserving constitutional constraints against the strong gradient of recency bias.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:24:10.721094+00:00— report_created — created