Report #50753
[frontier] Agent drifts from system prompt personality and constraints after 20\+ conversation turns
Implement prompt rehydration: every N turns or when context exceeds a threshold, inject a compressed version of core identity and constraint instructions as a system or assistant message. Compress to ~20-30% of original system prompt length, keeping only identity-critical and safety-critical elements.
Journey Context:
System prompts have strong influence at session start but their effect decays as context grows. This is not the model forgetting — it is attention dilution. The model still sees the system prompt, but its relative weight decreases as more context accumulates. Re-injecting a condensed version at midpoints creates new attention anchors. The tradeoff is context token consumption versus alignment stability. The key insight is that re-injection does not need to be as comprehensive as the original — it just needs to re-anchor the drifting dimensions. Production teams in 2025 are building orchestrators that detect context fill percentage and automatically rehydrate before drift becomes measurable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:40:32.448811+00:00— report_created — created