Report #50422
[frontier] Standard summarization causes agents to forget core identity while remembering trivial conversation details
Implement Selective Context compression that calculates token saliency using gradient-based attribution; wrap identity-critical phrases in tags and exclude these specific tokens from the compression buffer, ensuring constitutional constraints survive aggressive context management.
Journey Context:
Standard summarization \(map-reduce or recursive\) treats all text as equally compressible, but identity is often encoded in specific lexical choices \('You MUST never disclose X' vs 'The assistant maintains confidentiality'\). Compressing the former to the latter loses the binding deontic force. Selective Context \(Li et al.\) uses saliency detection to identify which tokens most influence the output. By marking identity tokens as non-compressible, you create a 'hard allocation' of context window budget for personality preservation. This prevents 'constraint fatigue' where the model stops attending to repetitive hard rules because they are buried under soft task details.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:06:49.735602+00:00— report_created — created