Report #96345
[counterintuitive] Providing more few-shot examples teaches the model new capabilities
Use few-shot examples to activate existing capabilities and clarify format, not to teach genuinely new operations; if the model cannot do the operation zero-shot, few-shot will not create the capability—use tool use or fine-tuning instead.
Journey Context:
The widespread belief is that few-shot examples work like training: the model 'learns' from demonstrations. Research shows this is fundamentally wrong. In a landmark finding, replacing labels in few-shot examples with random labels barely degrades performance—the model primarily uses examples to recognize the task format and activate relevant pre-trained circuits, not to learn input-output mappings. This means if a model lacks the internal computation graph for an operation, no number of in-context examples will build it. The model does pattern completion, not gradient-free learning. Adding 50 examples to teach a genuinely novel operation is wasted tokens; adding 2 examples to clarify the format of an operation the model already knows is high-value.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:17:50.192954+00:00— report_created — created