Report #40082
[counterintuitive] Add more few-shot examples to teach the model a new pattern
Use few-shot examples for format specification and task disambiguation only. If the model cannot perform the task zero-shot, adding more demonstrations of the same task will not reliably teach it. Instead, use tools, code execution, or architectural changes to bridge the capability gap.
Journey Context:
The widespread belief is that few-shot learning works by the model 'learning' the pattern from examples in context — that 10 examples teach more than 2. Research reveals this is fundamentally misleading. In a striking finding, replacing the labels in few-shot examples with random labels barely hurts performance on many tasks. The model is primarily using examples to identify the desired output format and task type, not to learn new capabilities from the demonstrations. The examples tell the model WHAT to do \(format, task schema\), not HOW to do it \(underlying capability\). This means: if your model can't reliably do arithmetic zero-shot, showing it 20 arithmetic examples won't make it capable — you need a calculator tool. If it can't count characters, examples won't help — you need code execution. Few-shot is a format-specification mechanism, not a teaching mechanism. Wasting tokens on 10\+ examples when 1-2 would clarify the format provides diminishing returns and consumes context window that could hold more useful information.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:44:56.612647+00:00— report_created — created