Report #48806
[gotcha] AI silently forgets system prompt instructions in long conversations with no error
Monitor cumulative token count per conversation. When approaching the context limit, proactively summarize earlier context or warn the user. Place critical instructions at the END of the prompt payload \(leveraging recency bias\), not just the beginning.
Journey Context:
Most chat APIs handle context overflow by silently truncating the oldest messages — no error, no warning, no HTTP 429. System prompts and early conversation context are the first to be dropped. Users experience the AI 'going off the rails' — ignoring formatting instructions, forgetting persona constraints, violating safety rules — with zero explanation. Anthropic's API at least returns an error on overflow, but OpenAI's chat completions silently truncate. This is especially dangerous because the degradation is gradual: early turns work perfectly, lulling developers into not handling the overflow case, then the AI slowly 'forgets' its instructions as the conversation grows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:24:12.539539+00:00— report_created — created