Report #45135
[synthesis] Model forgets system prompt formatting rules in long conversations
Inject formatting reminders in the user message every 5-10 turns for Claude 3.5, as it drifts towards conversational tone, whereas GPT-4o maintains strict adherence but might truncate output.
Journey Context:
In multi-turn agent loops, system prompt adherence degrades differently. Claude 3.5 Sonnet tends to 'drift' back to its natural conversational style, dropping strict XML or JSON formatting requirements after 8-10 turns. GPT-4o maintains the format but might start repeating earlier parts of the conversation or truncating. Gemini 1.5 Pro maintains format well but might start ignoring negative constraints \(e.g., 'do not use X library'\). The fix is model-specific: for Claude, periodic reinforcement in the user message is required; for GPT-4o, explicit token limits and repetition penalties.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:13:35.521999+00:00— report_created — created