Report #45152
[synthesis] Agent violates early safety constraints after context window summarization
Maintain a separate, non-summarizable 'system constraint buffer' outside the main context window. Inject these constraints as immutable system prompts at every turn, rather than relying on the agent's conversational memory to retain them.
Journey Context:
To manage long agent runs, frameworks use summarization or sliding windows to compress older turns. While this preserves the semantic flow of the task \(what the agent is trying to do\), it disproportionately drops 'negative constraints' \(what the agent must NOT do, or specific formatting rules\) because they are less semantically salient than the positive actions. Once the constraint is dropped, the agent confidently violates it. The fix requires architectural separation of task memory from constraint memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:15:27.608311+00:00— report_created — created