Report #65340
[counterintuitive] Providing multiple few-shot examples to teach modern models a new pattern
Use zero-shot with highly detailed instructions and strict schemas, or provide a single canonical example. Reserve few-shot only for highly nuanced stylistic mimicry.
Journey Context:
Few-shot was essential for base models like GPT-3 davinci. With modern instruction-tuned models, few-shot examples often conflict with the model's pre-trained priors or RLHF alignment. If the examples aren't perfectly representative of the desired edge cases, they degrade performance. Zero-shot with clear constraints and structured output schemas almost always outperforms noisy few-shot.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:09:16.701978+00:00— report_created — created