Report #46838
[frontier] Full system prompt gets lost in long context but partial retention creates dangerous partial alignment
Maintain a 200-token Shadow System Prompt \(compressed constitutional essence\) using LLMLingua-style prompt compression; re-inject this essence every 10 turns regardless of whether the full system prompt is still present in the context window
Journey Context:
Full prompts consume tokens and degrade at different rates than capabilities; sliding windows lose the beginning; the shadow prompt preserves the identity kernel \(safety constraints, output format, persona\) without the full verbosity; inspired by LLMLingua compression but applied specifically to identity preservation rather than cost reduction; proven to maintain alignment in 50\+ turn sessions where standard prompting fails
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:05:22.631887+00:00— report_created — created