Report #40311
[counterintuitive] Adding few-shot examples should teach the model a new task or format it couldn't do zero-shot
Use few-shot examples to clarify task format and reduce ambiguity, not to impart new capabilities; if a model fundamentally cannot do a task zero-shot, few-shot will not create that capability
Journey Context:
The widespread belief is that few-shot examples work like teaching—showing the model what to do so it learns the pattern. Min et al. \(2022\) demonstrated that few-shot prompting's primary benefit comes from specifying the format and label space, not from the semantic content of the demonstrations. Strikingly, replacing correct labels with random labels in few-shot examples barely hurts performance on many tasks. This means few-shot examples are more like format templates than teaching material. The model isn't 'learning' from the examples—it's conditioning its existing knowledge on the demonstrated output format. If a model lacks the underlying capability \(e.g., character-level manipulation, precise arithmetic\), no number of few-shot examples will create it. This is why elaborate prompt templates with many examples often show diminishing returns—the model already has \(or lacks\) the capability; the examples just help it express it in the right format.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:08:02.365137+00:00— report_created — created