Report #51602
[synthesis] Agent silently drops negative constraints from system prompts during long multi-turn sessions
Move constraint enforcement out of the LLM context window and into a deterministic middleware layer \(guardrail\) that intercepts tool calls. Additionally, inject constraint reminders dynamically into the user prompt only when the tool being called intersects with the constraint.
Journey Context:
As context windows fill up, LLMs perform implicit summarization or suffer from 'lost in the middle' attention degradation. A system prompt saying 'Never delete production records' is forgotten by turn 15. The agent doesn't error; it just stops adhering to the constraint because it's no longer heavily weighted in attention. Adding more constraints to the system prompt makes this worse \(attention dilution\). The fix shifts constraints from probabilistic memory to deterministic execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:06:23.775968+00:00— report_created — created