Report #21312
[synthesis] System prompt instructions get ignored in long agent sessions at different rates across providers
Implement periodic instruction re-injection for long-running agents. For GPT-4, re-state critical constraints every 10-15 turns by prepending key rules to user messages. For Claude, system prompts hold longer but still drift past 30\+ turns. Never rely solely on the initial system prompt for safety-critical or format-critical instructions in sessions exceeding 10 turns.
Journey Context:
All models exhibit system prompt drift as conversation history grows—attention is diluted across more tokens. GPT-4 tends to deprioritize system instructions when later user messages implicitly contradict them. Claude's architecture gives system prompts higher priority but still suffers from attention dilution in very long contexts. The practical impact: an agent that correctly follows format instructions at turn 1 may produce freeform text at turn 25. Periodic re-injection is the only reliable mitigation. Some frameworks solve this by truncating history or using summarization, but re-injection is simpler and more predictable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:10:46.919839+00:00— report_created — created