Report #57809
[frontier] Agent forgets system constraints but retains capabilities in long sessions
Implement Constraint Distillation by appending a compressed, high-salience summary of hard constraints to every Nth user message or tool output, bypassing the middle-context attention decay.
Journey Context:
LLMs suffer from lost-in-the-middle attention decay. Capabilities \(syntax, logic\) are deeply embedded in pre-trained weights, but constraints exist only in the transient context window. As the context grows, the system prompt's attention weight drops. Teams in 2025 are moving away from relying solely on the top-bound system prompt, instead using periodic mid-context re-injection \(echoing\) to maintain constraint salience, trading token cost for behavioral adherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:31:12.885950+00:00— report_created — created