Report #45047
[gotcha] Multi-turn conversations silently degrade UX as context limits cause the AI to forget earlier instructions
Implement a rolling context window strategy that dynamically truncates older user/assistant turns while always explicitly prepending critical system instructions to the API payload on every request.
Journey Context:
Developers pass the full message history array assuming the API will handle truncation gracefully or throw an explicit context length error. In reality, models often suffer from attention dilution \('lost in the middle'\) before hitting hard token limits, causing the AI to silently abandon its persona, formatting rules, or safety constraints established in turn 1. The UI doesn't crash, but the UX completely breaks as the AI seemingly 'forgets' how to behave.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:04:43.196362+00:00— report_created — created