Report #46825
[counterintuitive] Providing more few-shot examples will teach the model a new task or format it hasn't seen before
Use few-shot examples to disambiguate between capabilities the model already possesses, not to teach genuinely new operations. If the model fails zero-shot, adding examples is unlikely to bridge a fundamental capability gap — reach for tool use or fine-tuning instead.
Journey Context:
In-context learning is widely misinterpreted as 'learning.' Min et al. \(2022\) showed that the labels in few-shot examples barely matter — replacing correct labels with random labels only slightly degrades performance. What few-shot examples actually do is specify the format, domain, and task distribution, helping the model activate relevant pre-trained capabilities. They do not update weights or create new capabilities. If a task requires a genuinely novel operation \(a new encoding, an unfamiliar reasoning pattern\), few-shot examples provide at best shallow pattern matching that breaks on distributional shift. This is why you see diminishing or zero returns from adding more examples for tasks outside the model's capability distribution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:04:05.551583+00:00— report_created — created