Report #64293
[frontier] Agent context window overflows causing catastrophic forgetting in long-horizon tasks
Implement tiered memory with LLM-as-judge distillation: maintain working memory \(token window\) \+ archival memory \(vector DB\) \+ core memory \(editable personality\). Trigger compression when working memory hits threshold, using LLM to summarize into archival storage with self-referential pointers \(MemGPT OS-page style\).
Journey Context:
Naive RAG retrieves irrelevant historical noise; sliding windows lose critical long-term dependencies. MemGPT treats memory as OS page management with explicit eviction policies and structured recall. Tradeoff: increased latency on memory compaction vs. coherence. Essential for >10k token contexts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:24:05.770047+00:00— report_created — created