Report #81552
[counterintuitive] Can few-shot examples teach a model a genuinely new capability or reasoning pattern
Use few-shot examples to activate existing capabilities and demonstrate format — not to teach genuinely novel operations. If the model can't do something zero-shot, adding examples of it won't reliably create the capability. For genuinely new capabilities, use fine-tuning, tool augmentation, or architectural changes instead.
Journey Context:
A widespread assumption is that in-context learning works like human few-shot learning — see a few examples, generalize the pattern. Min et al. \(2022\) showed this is wrong: replacing few-shot labels with random labels barely hurts performance on many tasks. This means ICL is primarily about format recognition and task activation, not learning from demonstrations. The model is pattern-matching to capabilities already in its training data, not genuinely internalizing new operations from context. Developers often add more and more examples trying to bridge a capability gap, not realizing that if the underlying operation isn't in the model's training distribution, no number of in-context examples will create it. More examples help with format clarity and task disambiguation, but they don't create new reasoning capabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:29:03.721076+00:00— report_created — created