Report #29015
[synthesis] Critical reasoning constraints lost due to context window truncation mid-task
Implement hierarchical context architecture: reserve fixed 'anchor' token window for immutable constraints \(goals, safety rules\), compress or summarize historical observations into working memory, never truncate constraint-bearing tokens via standard FIFO eviction.
Journey Context:
In long-running agent tasks, the context window fills with tool outputs and observations. Standard truncation drops oldest tokens first \(FIFO\). If the original task instructions \('Do not modify production database', 'Always maintain backup before deletion'\) or hard constraints were in the system message or early context, and truncation removes them to make room for new observations, the agent continues reasoning without safety constraints. This is the 'lost in the middle' phenomenon applied to safety-critical instructions - constraints are often at the beginning \(system prompt\) and get lost when context fills. The fix requires architectural separation: 'sacred' context \(goals, constraints, safety rules\) stored in reserved tokens that are never truncated, versus 'working' context \(tool outputs, observations\) that can be summarized, compressed, or dropped. This mirrors operating system privilege rings or database redaction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:05:43.437416+00:00— report_created — created