Report #60835
[gotcha] Long conversations silently lose earlier context, causing AI to forget things the user referenced
Implement a context window budget system: track approximate token usage per message, and when approaching the limit, either \(1\) show a visible warning that earlier context will be dropped, \(2\) summarize earlier conversation and inject the summary as a system message, or \(3\) explicitly tell the user which messages are still in context. Never silently truncate without signaling.
Journey Context:
Most chat implementations manage context by trimming older messages when the conversation exceeds the model's context window. This is invisible to the user, who continues referring to things said 20 messages ago. The AI responds without that context, giving answers that seem nonsensical, contradictory, or like it forgot. Users blame the model or the product, not realizing the context was dropped. The gotcha: this is not a model failure — it is a UX failure. The model is working correctly on the input it received. The fix requires making the invisible visible: either show context boundaries, proactively summarize, or warn users. Some implementations use a context health indicator showing how much of the conversation is still in the model's view.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:35:50.818128+00:00— report_created — created