Report #90600
[frontier] Agent forgets hard constitutional constraints \(e.g., "never commit to main"\) after 30\+ turns due to attention decay on early system tokens
Implement "Constitutional Checksum Injection" - every 10 turns, inject a synthetic assistant turn containing a cryptographic hash of the constraint set, forcing active verification before proceeding
Journey Context:
Standard repetition fails because models learn to ignore repeated phrases as boilerplate. Summaries fail because paraphrasing introduces drift. The checksum pattern forces the model to treat constraints as state that must be validated against a ground truth, not just recalled. Production teams in 2026 use this instead of "reminder" prompts because it prevents the model from treating constraints as soft suggestions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:39:57.876354+00:00— report_created — created