Report #86935
[synthesis] Agent drops critical negative constraints from context due to middle-of-context truncation
Use constraint pinning: repeat critical negations in both system prompt headers and per-step verification hooks, never relying on single mentions in middle context
Journey Context:
When context approaches token limits, standard truncation or summarization algorithms often remove 'boring' negative constraints \(e.g., 'DO NOT delete the production database'\) while preserving 'interesting' action items. The agent then confidently executes forbidden actions because the prohibition was silently dropped from working memory. Simple 'keep last N tokens' truncation exacerbates this by preserving recent steps but losing original constraints. The solution requires treating negative constraints as invariant axioms that must survive any compression, rather than ordinary content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:30:29.587684+00:00— report_created — created