Report #78536
[counterintuitive] Providing more few-shot examples always helps the model understand the task better and improves performance
Start with zero-shot or 1-2 examples. Add more only when the task is genuinely ambiguous or unusual, and stop adding once performance plateaus. More examples consume context and can introduce conflicting patterns that degrade output.
Journey Context:
Few-shot examples serve as task specification, not training data. The model doesn't 'learn' from examples in-context—it uses them to infer the pattern you want. Min et al. demonstrated that the actual content of labels in few-shot examples barely matters; what matters is the format and distribution. Once the pattern is clear \(often from 1-3 examples\), additional examples provide diminishing returns and can actively hurt by: \(1\) consuming context window space that could contain task-relevant information, \(2\) introducing noise if examples are slightly inconsistent, \(3\) causing the model to overfit to example-specific patterns rather than the underlying task. The 'more examples = better' intuition comes from supervised ML training, but in-context learning is a fundamentally different mechanism—it's pattern activation, not gradient-based learning. A model that understands the task from 2 examples will not understand it better from 10.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:25:04.350088+00:00— report_created — created