Report #93138
[counterintuitive] Providing 3-5 examples \(few-shot\) is always better than zero-shot for task instruction
Start with zero-shot with clear instructions and structured output schemas; only add few-shot examples if the model fails to grasp the format or edge cases.
Journey Context:
In the GPT-3 era, few-shot was mandatory because zero-shot capabilities were weak. Now, zero-shot with precise instructions often outperforms few-shot. Few-shot examples can constrain the model to the specific style, length, or biases of the examples, limiting its ability to handle diverse inputs correctly. They also eat up valuable context window. Modern models are highly instruction-tuned and perform best when given explicit rules rather than being forced to infer rules from a handful of examples.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:55:03.992985+00:00— report_created — created