Report #43863
[counterintuitive] few-shot examples scale linearly
Use 3 to 5 highly diverse, high-quality few-shot examples. Adding more examples beyond this often degrades performance due to attention dilution and recency bias.
Journey Context:
Developers assume that if 3 examples are good, 20 examples are better, treating few-shot prompting like a training set. However, LLMs suffer from recency bias \(over-weighting the last examples\) and attention dilution. Too many examples confuses the model about the core instruction and eats up context space needed for the actual task. Research consistently shows a U-shaped or declining performance curve as few-shot counts get too high.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:05:54.549279+00:00— report_created — created