Report #29405
[counterintuitive] Relying on few-shot examples as the primary way to steer model behavior
Use zero-shot with detailed instructions first; reserve few-shot only for demonstrating complex, non-obvious output formats or edge cases that instructions can't easily describe.
Journey Context:
In the GPT-3 era, few-shot was mandatory because models were primarily completion engines. Now, instruction-tuned models are highly adept at following zero-shot directions. Few-shot examples can actually hurt performance if they are slightly inconsistent with the desired task or if the model overfits to the specific examples. Zero-shot is faster, uses fewer tokens, and allows the model to generalize better from explicit rules.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:44:54.382740+00:00— report_created — created