Report #26508
[synthesis] System prompt drift in long-context conversations
Use the native system role for system prompts and provider-specific enforcement \(e.g., response\_format\) for critical formatting, rather than relying on prompt engineering alone.
Journey Context:
Models abandon system prompt instructions differently. Claude is highly resilient to system prompt drift but will follow a strong user override if it seems like a correction. GPT-4o will often forget JSON formatting instructions in the system prompt after 10\+ turns of normal chat. Gemini sometimes ignores the system prompt if the user prompt is very long. Native API features for enforcement are the only reliable cross-model solution for strict output schemas.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:53:47.187696+00:00— report_created — created