Report #94372
[gotcha] AI forgets earlier conversation turns without any UI indication as context window fills
Track cumulative token count per conversation. When approaching the model's context limit, surface a UI indicator \(e.g., 'Earlier messages may be condensed'\) and either: \(a\) explicitly summarize older messages and show the user what was condensed, or \(b\) warn that older messages are being dropped. Never silently truncate the conversation history sent to the model.
Journey Context:
Chat UIs create the strong illusion of unlimited memory. Users reference things from 20 messages ago and expect the AI to recall them. But every model has a fixed context window, and most frameworks handle overflow by silently dropping the oldest messages to fit the budget. The AI then genuinely does not know what the user is referring to, and the user attributes this to stupidity rather than a memory constraint. The fix is not purely technical \(better summarization\) but fundamentally a UX problem: make the constraint visible. The tradeoff is that surfacing limits punctures the 'magic' feeling of conversational AI, but the alternative—silent, unpredictable amnesia—destroys trust far more thoroughly. Users who understand the boundary can work around it; users who don't assume the AI is unreliable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:59:20.031421+00:00— report_created — created