Report #87306
[synthesis] Agent loses formatting or forgets instructions in long multi-turn conversations
Implement a 'rolling system prompt' for GPT-4o by re-injecting critical instructions into the latest user message. For Claude, periodically summarize middle turns. For Gemini, append few-shot formatting examples at the end of the context.
Journey Context:
Context degradation manifests as distinct failure signatures across models. GPT-4o 'forgets' early system instructions around 60-80k tokens, subtly dropping constraints. Claude 3.5 Sonnet maintains system instructions better but starts dropping middle conversation turns \(middle-turn amnesia\). Gemini 1.5 Pro maintains the whole context but suffers attention degradation, resulting in ignoring specific formatting constraints. A uniform context management strategy fails; mitigation must target the specific attention decay fingerprint of the model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:07:53.287080+00:00— report_created — created