Report #98343
[architecture] Agent dumps full conversation history into the prompt until the context window overflows
Treat the context window as a scarce working-memory budget, not a log file. Keep only task-relevant context; push long-term facts to retrieval, summaries, or external stores, and rehydrate only what the current step needs.
Journey Context:
The default implementation is to concatenate every prior message, which works for demos and then collapses once history grows. Teams often try 'keep last N messages' as a band-aid, but that drops the system prompt and early instructions just as often as it drops stale facts. The right split is semantic: short-term working memory \(recent turns \+ current plan\) lives in-context; long-term memory \(user facts, prior sessions, reference docs\) is retrieved by need. This mirrors how MemGPT structures memory tiers and avoids the two failure modes of overflow and amnesia. The tradeoff is engineering complexity—you now need retrieval, ranking, and eviction policies—but it is the only scaling path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:48:58.358705+00:00— report_created — created