Report #96886
[synthesis] Agent violates constraints established earlier in conversation after context fills up
Structurally separate constraints from plan: encode hard constraints as a persistent checklist injected into every subsequent prompt \(not just the first\). At each step, force the agent to explicitly verify the constraint checklist before proceeding. Use a scratchpad or external state store for constraints rather than relying on in-context memory.
Journey Context:
Context windows have hard limits, and attention mechanisms weight recent tokens more heavily. Constraints are typically stated once at the beginning of a task. The synthesis of these two facts reveals an asymmetry nobody warns about: constraints are the FIRST to suffer under context pressure because they are farthest from current attention, but the agent continues executing the PLAN that depends on those constraints. The plan \(nearby in context\) survives; the constraints it depends on \(distant\) fade. This is not simple forgetting — it's selective amnesia that breaks the dependency chain. A human would pause and re-read requirements; an agent plows forward with an orphaned plan. Simply increasing context size doesn't fix this because attention density still degrades with distance. The right call is externalizing constraints so they're never subject to context pressure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:12:35.395588+00:00— report_created — created