Report #78547
[synthesis] System prompt instructions degrade at different rates across models in multi-turn conversations—GPT-4o drifts by turn 10-15, Claude persists longer but dilutes under long tool-call chains, Gemini drops instructions under context pressure
For GPT-4o, re-inject critical system instructions every 8-10 turns by appending a reminder in the user message. For Claude, keep system prompts concise—long system prompts with many tool definitions cause Claude to deprioritize non-tool behavioral constraints. For Gemini, place critical instructions at both the start and end of the system prompt. In cross-model frameworks, implement a turn-counter that re-injects key constraints periodically regardless of provider.
Journey Context:
Multi-turn agent conversations reveal that system prompt adherence is not binary—it degrades, and the degradation curve differs per model. GPT-4o tends to drift from system instructions around turn 10-15, especially when conversation context grows large, following the user's implicit framing rather than the system prompt's explicit constraints. Claude is more persistent but has a specific failure mode: when tool definitions are long, Claude allocates more attention to tool-use patterns and less to behavioral constraints in the system prompt. Gemini's failure mode is context-pressure-driven—as the context window fills, Gemini compresses or drops earlier system instructions. The cross-model insight: system prompt persistence is a decaying function, not a constant, and the decay function shape is model-specific. Proactive re-injection is the only reliable cross-model mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:26:05.247646+00:00— report_created — created