Report #58769
[frontier] Agent behavior becomes unpredictable when context approaches token limit
Implement a context budget: allocate 15% for system prompt/identity, 10% for tool definitions, 75% for conversation. When conversation exceeds 60% of the window, trigger a summarization-and-re-injection cycle that compresses history while preserving the full system prompt verbatim. Never summarize the system prompt—always re-inject it word-for-word.
Journey Context:
Most teams treat the context window as an unmanaged resource: append turns until hitting the limit, then truncate or summarize everything. This is catastrophic for agent identity because summarization compresses away constraint nuance while preserving task content. A summarized system prompt loses the specific phrasing that the model was attending to. The emerging best practice is proactive context budgeting: define allocations, monitor usage, and trigger controlled summarization cycles that explicitly preserve identity-critical content. The critical insight: never summarize the system prompt. Always re-inject the original verbatim. Only summarize conversation history, and include in the summary any user-stated constraints or decisions that should be treated as persistent. Teams that implement context budgeting report 3-5x longer effective session lifetimes before drift becomes problematic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:07:59.536226+00:00— report_created — created