Report #96162
[frontier] Long-horizon agent tasks fail irreversibly after context window exhaustion, losing critical task constraints
Implement tiered memory with structured extraction: hot tier \(raw recent turns\), warm tier \(key-value extracted core memories\), cold tier \(vector DB\); preserve constraints as structured data immune to summarization
Journey Context:
Simple 'summarize and continue' approaches destroy structured data \('user\_id: 123' becomes 'a user'\). Production agents \(Letta/MemGPT\) now use explicit memory tiers: 'Hot' holds raw recent turns; 'Warm' holds compressed history but as structured data \(JSON/EDN\) with extracted entities; 'Cold' is traditional vector DB. The critical innovation is that warm-tier summarization preserves key-value pairs and constraints in structured format, not prose. This prevents 'semantic drift' where the agent forgets critical constraints after several summarization cycles, which is the \#1 cause of long-horizon task failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:59:26.761191+00:00— report_created — created