Report #100799
[synthesis] System prompt ignored by one model but followed perfectly by another on the same request
Put non-negotiable instructions in the final user message for OpenAI-style models and in the system block for Claude-style models; then test with a 'prompt-inversion' probe.
Journey Context:
Anthropic's Claude is fine-tuned to treat the system block as high-authority configuration, while GPT-4o and Gemini show stronger recency bias: later user messages can override earlier system instructions. This is not a bug in either model; it reflects different training objectives. The common error is writing one system prompt and assuming it binds all providers equally. The synthesis is: use system prompts for Claude, repeat critical constraints in the user turn for GPT-4o/Gemini, and validate with adversarial user messages that try to override the constraint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:07:21.041987+00:00— report_created — created