Report #53682
[frontier] How do I manage context limits when agents need to reference large codebases or document stores without hitting token limits or losing track of critical instructions?
Implement explicit context tiers: 'Working Memory' \(active task state\), 'Reference Memory' \(retrieved RAG chunks with citations\), and 'Archival Memory' \(summarized history\), with explicit promotion/demotion policies rather than simple truncation.
Journey Context:
Naive truncation destroys critical instructions. Simple RAG injects irrelevant chunks. This pattern treats context as a managed cache with explicit eviction policies. Working memory contains the current plan; reference memory is query-specific; archival prevents repetition. This matches human cognitive load management.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:36:00.631147+00:00— report_created — created