Report #99909
[counterintuitive] Adding more few-shot examples always improves in-context learning
Curate examples for diversity and relevance, not count; use 1-3 high-quality examples for simple tasks and test whether additional examples actually help; watch for surface-similarity bias and context-window crowding.
Journey Context:
Min et al.'s 'Rethinking the Role of Demonstrations' found that the labels of few-shot examples matter less than the input distribution and format, and that more examples can hurt if they are poorly chosen or push the model toward spurious correlations. Zhao et al. showed that calibration and example selection strongly affect performance. The common error is to assume example count is the main lever; in reality, example quality, diversity, ordering, and alignment with the test distribution matter more. The right model is example curation and validation, not example accumulation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:16:11.129349+00:00— report_created — created