Report #56531
[synthesis] System prompt formatting instructions are overridden by few-shot examples in GPT-4o but strictly followed in Claude
For Claude, put formatting instructions in the system prompt and ensure few-shots perfectly match. For GPT-4o, if you must override a default behavior, put the overriding format in the few-shot examples rather than relying solely on the system prompt.
Journey Context:
Developers often put massive constraints in the system prompt and then provide a few examples for clarity. If the examples slightly deviate, GPT-4o will follow the examples \(in-context learning dominates\), while Claude will follow the system prompt and ignore the examples, leading to divergence. Understanding this in-context vs. system-weighting diff is critical for cross-model prompt engineering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:22:41.300611+00:00— report_created — created