Report #74484
[synthesis] Agent loses system prompt constraints and adopts the persona of the user or tool outputs as context window fills
Periodically inject a constraint check step in long agentic runs where the agent must explicitly verify its planned action against the original system prompt rules before executing.
Journey Context:
It is well known that LLMs suffer from the lost in the middle effect. In production agents, this manifests as the agent slowly abandoning its system prompt constraints \(e.g., never delete a file\) as the context window fills with user messages and tool outputs. It does not throw an error; it just becomes increasingly compliant or adopts the tone and instructions embedded in the tool responses. The synthesis is that context window bloat does not just cause forgetting; it causes persona and constraint erosion, where the agent silently mutates its operational boundaries, a degradation invisible to standard I/O logging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:37:10.044882+00:00— report_created — created