Report #61500
[counterintuitive] Should I add many few-shot examples to improve LLM accuracy
Use 3-5 highly diverse, high-quality few-shot examples rather than dozens; beyond a small number, the model suffers from recency bias and overfits to the examples, degrading performance on edge cases.
Journey Context:
Adding more few-shot examples seems like an easy way to boost accuracy, but LLMs have a strong recency bias. When given many examples, they tend to overfit to the patterns of the last few examples in the prompt, ignoring the instruction or earlier examples. Research shows that the ground truth label in few-shot examples matters less than the format, and quality/diversity matters far more than quantity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:43:03.589332+00:00— report_created — created