Report #52531
[gotcha] AI quality degrades in long conversations with no UI signal of context window exhaustion
Track cumulative token usage across the conversation using the usage field in API responses. Display a context usage indicator. When approaching 80% of the context window, warn users that earlier context may be lost. Offer to summarize the conversation and start a fresh context. Never let quality silently degrade without explanation.
Journey Context:
LLMs have fixed context windows. As conversation history grows, the model either truncates earlier messages or has less room for detailed responses. The API doesn't error until you exceed the limit — it just produces progressively worse answers. Users perceive the AI 'getting dumber' or 'forgetting things' without understanding why. The counter-intuitive part: more conversation history doesn't always help — beyond a point, it actively hurts quality because earlier context gets silently dropped or the model has less budget for new reasoning. The fix is proactive UX: show context consumption, warn before degradation, and offer conversation management.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:40:07.456890+00:00— report_created — created