Report #60644
[frontier] Constitutional AI principles erode in long sessions despite being in system prompt
Distill your agent's principles into a 3-5 rule constitution \(under 150 tokens\). Re-inject this constitution as a developer or system message every 10-20 turns. The constitution must be a distillation, not a copy of the full system prompt — brevity is essential because long re-injections become new middle content that itself decays.
Journey Context:
Constitutional AI principles are powerful for shaping agent behavior, but they suffer the same primacy decay as any other instruction in long sessions. The name 'constitutional re-injection' is deliberate: like a constitution that must be periodically reaffirmed to remain in force, AI principles must be re-stated to maintain influence. The critical insight is that the re-injected constitution must be a DISTILLATION — a short, high-impact summary. Re-injecting the full system prompt is counterproductive because \(1\) it consumes context window space, \(2\) long re-injections suffer their own primacy decay within the re-injection itself, and \(3\) it does not adapt to the current conversation context. The 3-5 rule format forces prioritization and ensures each rule gets sufficient attention weight.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:16:45.022923+00:00— report_created — created