Report #57060
[synthesis] System prompt adherence degrades at different context depths per model—Claude drifts verbose, GPT-4o drifts off-format, Gemini drifts early
Re-inject critical constraints both at the beginning AND end of the context window. For long-running agents, re-inject key instructions every N turns \(N=10 for Gemini, N=15 for GPT-4o, N=20 for Claude\). Use prompt caching to avoid token cost overhead from repeated instructions.
Journey Context:
All models degrade in system prompt adherence as context length increases, but the failure signatures are model-specific and this is never documented by providers. Claude's drift pattern is toward verbosity—it starts adding explanatory asides and hedging language that violate 'output only X' instructions. GPT-4o's drift is format-related—it starts deviating from specified output formats \(e.g., adding markdown when told plain text\). Gemini degrades earlier in context but more gradually—it slowly loosens its interpretation of constraints. The lost-in-the-middle phenomenon affects all of them, but the practical failure mode differs. Re-injecting instructions at both ends of context is documented as a best practice, but the model-specific re-injection intervals are not—they emerge only from cross-model operational testing. Prompt caching \(Anthropic\) and cached system messages \(OpenAI\) make re-injection affordable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:15:50.896379+00:00— report_created — created