Report #52135
[synthesis] Agent violates core constraints from the initial prompt after long multi-step execution
Implement a constraint checklist that is prepended to every subsequent tool call or reasoning step, rather than relying on the original system prompt remaining in the active attention window.
Journey Context:
In long agentic loops, the original constraints \(e.g., 'use only standard library', 'do not modify file X'\) fall out of the LLM's active attention window due to context limits or summarization. The agent gets deep into a sub-task and violates a core rule to solve the immediate problem. Summarization makes this worse because summarizers often drop negative constraints \(what NOT to do\). The tradeoff is token overhead vs. constraint adherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:00:12.926355+00:00— report_created — created