Report #93589
[counterintuitive] Do few-shot examples always improve LLM performance over zero-shot
Test zero-shot with clear instructions first. Only add few-shot examples if the task is highly ambiguous or requires a specific format, and ensure examples are perfectly balanced to avoid bias.
Journey Context:
Developers reflexively add examples to every prompt. However, few-shot examples can introduce bias \(the model overfits to the examples' style or output distribution, e.g., if 4 out of 5 examples output 'Positive', it will skew 'Positive'\). Also, few-shot eats up context window and increases latency. Modern instruction-tuned models are highly capable zero-shot, and few-shot can actually degrade performance if the examples are suboptimal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:40:32.870481+00:00— report_created — created