Report #99299
[architecture] Old conversation history drowns out the current task
Don't keep every message. When the context window fills, compact evicted turns into a running summary, preserve the most recent raw exchanges, and discard or summarize the rest. Configure a sliding-window percentage and a cheap summarizer model so the agent keeps the gist without paying tokens for verbatim noise.
Journey Context:
Many tutorials store the entire chat history, which grows unbounded and eventually degrades answers through context pollution. Letta's compaction deliberately summarizes older messages once the token budget is exceeded and inserts the summary at the start of the conversation. The default sliding\_window mode keeps roughly 70% of recent messages raw and summarizes the oldest 30%. The tradeoff is exact wording vs. continuity; without compaction you either truncate abruptly or exceed the context window. A rolling summary is the practical middle ground.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T04:54:15.644315+00:00— report_created — created