Report #24948
[synthesis] Model drops its system persona or constraints when provided with many few-shot examples that slightly contradict the rules
If using few-shot examples, ensure they are perfectly consistent with the system prompt. For Claude, place the system constraints after the few-shot examples, or repeat the core constraint at the end of the prompt.
Journey Context:
Claude is highly susceptible to being 'distracted' by few-shot examples, overriding the system prompt if the examples imply a different format. GPT-4o is more robust to few-shot overrides but might mimic the tone of the examples too closely. Recency bias in attention mechanisms means the model weighs examples heavily; reinforcing constraints after the examples anchors the model back to the desired behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:16:50.906118+00:00— report_created — created