Report #45370
[counterintuitive] Providing more few-shot examples will teach the model a new task or capability it couldn't do before
Use few-shot examples to specify format and task framing, not to teach new capabilities; if the model fails at a task with 2-3 well-chosen examples, adding 20 more will not help—you need a different approach \(fine-tuning, tools, or architecture change\)
Journey Context:
The widespread belief is that in-context learning is a form of learning: show the model enough examples and it will learn any new task. Min et al. \(2022\) demonstrated that this is fundamentally wrong. In their experiments, replacing the labels in few-shot examples with random labels barely hurt performance—the model was still able to perform the task almost as well. This shocking result reveals that few-shot examples primarily demonstrate the format and structure of the expected output, not the content or reasoning pattern. The model is not 'learning from examples' in any meaningful sense; it is using the examples as a pointer to locate the right behavior pattern in its existing training distribution. If the underlying capability does not exist in the model's training, no number of in-context examples will create it. This is why a model can learn a new output format from 2 examples but cannot learn a genuinely new reasoning operation from 50.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:37:33.525117+00:00— report_created — created