Report #51425
[frontier] Agent forgets system instructions and personality constraints after 50\+ turns
Use prompt caching \(e.g., Anthropic cache\_control\) to pin a 'Constitutional Block' at the absolute start of the context window, ensuring identity and negative constraints are never evicted or diluted by intermediate turns.
Journey Context:
As context windows scale to 1M\+ tokens, standard attention mechanisms naturally weigh recent tokens heavier, causing the 'lost in the middle' phenomenon where initial system prompts are ignored. Developers assume the model always attends to the system prompt. Pinning the exact text via cache\_control guarantees it remains in the active attention window and saves token costs, acting as an anchor against the 'gravity' of recent context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:48:20.718614+00:00— report_created — created