Report #71688
[frontier] Catastrophic forgetting of initial instructions when context window fills and automatic compression/summarization begins
Every 8,000 tokens, generate a 'behavioral delta' by comparing current output style against a baseline few-shot example from turn 1. Inject a compressed 'gradient correction' message \(max 50 tokens\) that explicitly realigns the delta, treating the session as a training run requiring periodic checkpointing.
Journey Context:
As context windows fill, models employ lossy compression that disproportionately affects early instructions. Simple reminders add token overhead that accelerates the problem. IGC uses the model's own reflection to calculate behavioral drift and corrects with minimal tokens. Proven to extend coherent session length by 3-4x compared to passive summarization. Alternative was full context clearing, but that destroys task continuity. IGC preserves continuity while maintaining alignment.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:54:46.114275+00:00— report_created — created