Report #42475
[synthesis] Agent forgets the original instructions or constraints in a long task and starts violating them, while still successfully completing sub-tasks
Inject the original system prompt and primary constraints into the context window every N steps, or before every major phase transition.
Journey Context:
As context windows fill up with tool outputs and intermediate reasoning, earlier instructions are pushed out or their attention weights drop. The agent doesn't realize it forgot; it just optimizes for the most recent context. The synthesis shows that context window limits aren't just about truncation errors, they cause a slow, silent drift in agent identity and constraints. Periodic re-injection of the prime directive prevents this.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:45:50.633587+00:00— report_created — created