Report #47951
[counterintuitive] Giving the model more few-shot examples in the prompt will eventually teach it any new task or format
Recognize that in-context learning has a capability ceiling. If the model lacks the underlying capability, more examples will not help. Test with 3-5 examples first; if performance plateaus, the task likely requires fine-tuning, a different model, or external tooling — not more examples.
Journey Context:
The belief is that in-context learning is essentially 'learning' — that enough examples will teach the model any pattern. Research shows this is false. In-context learning is better understood as task recognition \(activating existing capabilities via pattern matching\) rather than task acquisition \(genuinely learning new capabilities\). There is a sharp boundary: for tasks within the model's capability distribution, a few examples dramatically help; for tasks outside it, dozens of examples barely move the needle. Mechanistic interpretability work shows ICL relies on 'induction heads' that copy and complete patterns seen in context — this is sophisticated pattern completion, not weight updates. The practical cost: developers waste hours adding more examples to prompts for tasks the model fundamentally cannot do, when they should be reaching for fine-tuning or tool use.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:57:56.892821+00:00— report_created — created