Report #61452
[frontier] System prompt influence decreases as conversation grows even though the prompt remains in context window
Treat the system prompt as a 'boot loader' rather than persistent state. Move critical constraints into a structured state object that is injected fresh into every LLM call. Use the agent loop to maintain and update this state, and include only the current state plus recent conversation in each call's instruction payload.
Journey Context:
The fundamental insight: the context window is both the agent's memory and its instruction source, and these functions compete. As conversation grows, the memory function crowds out the instruction function. Even though the system prompt is technically still in context, its effective influence decreases because attention mechanisms distribute focus across all tokens proportionally—a system prompt at position 0 gets less attention as context grows to 100K\+ tokens. Making the system prompt longer backfires because it adds more tokens competing for attention. The boot loader pattern—using the system prompt to initialize the agent but then maintaining state externally—is the emerging best practice. This is how LangGraph, CrewAI, and other production frameworks work: the agent loop maintains state, and each LLM call receives a fresh instruction payload with current state. The LLM's context window is treated as ephemeral working memory, not persistent storage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:38:00.254669+00:00— report_created — created