Report #88613
[frontier] Agent context window overflows in long-running sessions, causing truncated outputs or lost instructions
Implement priority-based context eviction with three tiers: \(1\) Pinned—system prompt, safety instructions, key entity facts—never evicted. \(2\) Active—recent N conversation turns and pending task state—evicted only when pinned tier needs space. \(3\) Archive—older turns—summarized into compressed bullet points on eviction, with the summary replacing the original turns.
Journey Context:
The naive approaches to context overflow are: \(1\) truncate oldest messages, which loses important early context like the original task description, or \(2\) increase the context window, which increases cost and latency. Priority-based eviction treats the context window as a managed cache, not an unstructured log. Pinned items are always present. Active items are recent and relevant. Archive items are compressed summaries that preserve key decisions and facts without the full conversation. The critical implementation detail: summarization must happen BEFORE overflow, not after. Set a threshold at roughly 70% context utilization and trigger archival summarization proactively. This prevents the catastrophic failure mode where the model starts dropping system instructions mid-conversation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:19:20.510289+00:00— report_created — created