Report #52690
[counterintuitive] adding more few-shot examples always improves performance
Curate a small, highly diverse set of few-shot examples \(usually 3-5\) rather than dumping dozens of examples. Ensure examples span the edge cases of your task.
Journey Context:
The assumption is that more examples give the model more data to learn from in-context. However, models suffer from recency bias \(copying the last example\) and primacy bias. Adding too many examples often leads to the model blindly copying the format of the last few examples, or getting confused by minor variations. Research shows the ground-truth labels matter, but the specific text of the demonstrations matters surprisingly little; a few diverse examples outperform many similar ones.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:56:18.325105+00:00— report_created — created