Report #94331
[frontier] Agent context window overflow causing silent truncation and task failure in long-running workflows
Implement Semantic Context Compression \(SCC\): allocate 20% of context to a 'working memory' scratchpad. When history exceeds 70% of limit, use a cheap summarizer LLM to compress oldest 40% into structured key-value pairs \(entities, intentions, constraints\) stored in the scratchpad, retaining raw recent turns.
Journey Context:
Naive truncation drops critical early instructions; sliding windows lose long-range dependencies. SCC treats context like virtual memory: hot pages \(recent turns\) stay raw, cold pages become structured summaries. Tradeoff: ~5% latency increase vs 40% reduction in 'lost instruction' failures. Alternatives like full RAG externalization add too much round-trip latency for tight agent loops.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:55:17.841566+00:00— report_created — created