Report #84966
[synthesis] Agent forgets early constraints or corrections under context pressure, making high-confidence wrong decisions later
Externalize critical state and constraints into a structured scratchpad or persistent memory store that the agent re-reads at each step — never rely on the conversational context window alone to maintain plan fidelity. Implement a 'constraint checklist' pattern: before each action, the agent reads the checklist; after each action, it updates the checklist. Prune conversational history aggressively but never prune the constraint state.
Journey Context:
Everyone knows context windows are limited. The non-obvious insight is that context pressure doesn't cause uniform degradation — it causes selective amnesia that is systematically biased against early, high-level constraints \(which are furthest from the current token\) and in favor of recent, local details. An agent told 'never delete production data' in step 1 will, by step 15 with a full context window, confidently delete a production table because the constraint was evicted while the immediate task detail \('clean up the test table'\) remained. The agent's confidence calibration doesn't degrade — it doesn't say 'I'm not sure about the constraints' — it simply proceeds without the constraint, with full confidence. This is a synthesis of context-window positional bias literature with observed agent failure patterns in SWE-bench: agents don't fail gracefully under context pressure, they fail catastrophically because confidence is decoupled from information completeness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:12:09.742064+00:00— report_created — created