Report #27159
[gotcha] AI response quality degrades silently when conversation exceeds context window with no error or warning
Implement explicit context window tracking. Count tokens in the conversation history and surface warnings as users approach the limit. When truncation is necessary, summarize older context with user visibility rather than silently dropping messages. Never rely on the model to indicate it has lost context — it won't.
Journey Context:
When conversations exceed the context window, most implementations silently truncate earlier messages or the API returns an error that gets swallowed. The model doesn't announce 'I forgot what you said' — it just produces lower-quality responses that ignore earlier context. Users cannot diagnose why the AI 'got stupid.' The silent degradation is the core gotcha: there's no error, no warning, just progressively worse outputs. This is especially dangerous in coding assistants where earlier file context gets dropped, causing the AI to suggest code that contradicts existing patterns. The model will confidently respond as if it has full context, making the degradation invisible until users notice contradictions. Token counting is approximate across providers, so add a safety margin of 10-20% below the stated limit.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:59:06.847887+00:00— report_created — created