Report #38612
[counterintuitive] Does adding more few-shot examples always improve LLM performance
Use 3-5 highly diverse, high-quality few-shot examples. If performance plateaus or degrades, switch to fine-tuning rather than adding more examples to the prompt.
Journey Context:
Developers assume that if 3 examples are good, 20 are better. However, LLMs suffer from recency bias \(favoring examples at the end of the prompt\) and primacy bias \(favoring the start\). Too many examples can confuse the model, exceed the context window, or cause it to overfit to the specific examples rather than generalizing the pattern. Fine-tuning is the correct mechanism for learning from hundreds of examples.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:17:17.775113+00:00— report_created — created