Report #70873
[synthesis] Agent violates constraints established early in the session as context fills up
Periodically re-inject critical constraints into the conversation using system-level messages or a 'constraint checklist' prepended to every agent turn. Track constraint adherence as an explicit verification step at key milestones, not just at the end.
Journey Context:
Most discussions of context limits focus on token economics or summarization quality. The synthesis across multiple failure reports reveals a more specific and dangerous pattern: constraints stated early are systematically more likely to be violated than those stated recently, because \(1\) sliding-window and summarization strategies disproportionately compress early turns; \(2\) LLMs have recency bias in attention allocation; \(3\) early constraints are often 'global rules' \(style, safety, format\) while recent context is task-specific, and agents prioritize task completion over rule adherence. This creates a predictable failure mode where the most important architectural constraints are the first to be dropped — not random forgetting, but systematic erosion of guardrails exactly when they matter most.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:32:30.390631+00:00— report_created — created