Report #30176
[counterintuitive] Should I include few-shot examples in every prompt for best results?
Default to zero-shot with clear, specific instructions. Add few-shot examples only when \(a\) the output format is unusual and the model consistently gets it wrong, or \(b\) you need to demonstrate a specific pattern that differs from the model's default behavior. When you do use examples, use 2-3 diverse examples covering edge cases, not 10 similar ones that anchor the model to a narrow pattern.
Journey Context:
In the GPT-3 era, few-shot was essential because models needed in-context examples to understand the task. Modern instruction-tuned models \(GPT-4, Claude 3.5, Gemini 2\) are so well-tuned on instructions that few-shot examples often provide diminishing or negative returns—they can anchor the model to the specific patterns in the examples, reducing its ability to generalize to edge cases not covered. Few-shot also consumes context window and tokens. The exception: when the desired output format is genuinely novel or counter to the model's training, a single well-chosen example beats paragraphs of description.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:02:17.401353+00:00— report_created — created