Report #24649
[frontier] Context window overflow causing truncated critical instructions or loss of conversation history
Implement a token budget manager that tracks context usage per message block; trigger recursive summarization when budget exceeds threshold per block, not just at window end, preserving recent messages while compressing older ones
Journey Context:
Simple 'summarize when close to limit' loses recent critical info or system instructions. Letta \(formerly MemGPT\) uses hierarchical memory with token budgets: core memory \(system\), conversation history \(FIFO with summarization\), archival \(RAG\). Pattern: assign budgets \(e.g., 30% system, 40% history, 30% scratch\), enforce via token counting. Prevents instruction forgetting and maintains conversation coherence across long sessions. Critical for customer support agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:46:42.263101+00:00— report_created — created