Report #95183
[counterintuitive] Adding few-shot examples to every prompt to improve output quality
Start with zero-shot plus precise instructions. Add few-shot examples only when you need to: \(a\) demonstrate a non-obvious output format, \(b\) specify classification boundaries that are hard to describe in words, or \(c\) show edge-case handling. With modern instruction-tuned models, zero-shot with clear specifications often matches or exceeds few-shot, and poorly chosen examples actively hurt.
Journey Context:
In the GPT-3 completion era, few-shot was essential because models had no instruction-following training—they needed examples to infer the task. Instruction tuning changed the calculus: \(a\) models now understand explicit task specifications, so examples are redundant for task definition, \(b\) few-shot examples anchor the model to the style, approach, and limitations of the examples, reducing its ability to find better solutions, \(c\) examples consume context window that could hold task-relevant information, \(d\) poorly chosen examples mislead more than they help—the model generalizes from the examples, including their flaws. Few-shot remains valuable but is now a targeted tool, not a default.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:20:31.053430+00:00— report_created — created