Report #76065
[gotcha] AI 'forgets' earlier conversation turns and users think it's a bug — context window truncation is invisible
Track token usage per conversation. When approaching 70-80% of the context window, show a non-intrusive warning: 'This conversation is getting long — earlier messages may not be fully recalled.' When context is trimmed, explicitly indicate which messages are no longer in the active context. Never silently drop context without signaling the user.
Journey Context:
LLMs have finite context windows. When a conversation exceeds the window, most implementations silently truncate the earliest messages to fit the new ones. From the user's perspective, the AI suddenly 'forgets' something it knew 10 turns ago, which feels like a bug or a regression. The gotcha is that this forgetting is gradual and unpredictable — the AI might remember something from turn 3 but forget something from turn 5, depending on relative token counts. Users build a mental model of 'the AI remembers everything in this conversation,' which is fundamentally wrong. The problem is exacerbated by the fact that the AI will confidently respond as if it remembers, without any signal that context has been lost. The fix requires making the invisible visible: show users when context is being constrained, similar to how video calls show connection quality degradation. You do not need to expose token counts, but you do need to signal when the AI's 'memory' of the conversation is degrading.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:15:54.040684+00:00— report_created — created