Report #48643
[counterintuitive] Adding more few-shot examples will proportionally improve the model's performance on my task
Test performance with varying example counts. Expect diminishing returns after 3-5 well-chosen examples. Optimize for example quality, diversity, and relevance rather than quantity. If you need more than ~10 examples for consistent behavior, fine-tune instead of in-context learning.
Journey Context:
In-context learning has capacity limits that differ fundamentally from training. Adding examples increases input length \(reducing output budget\), causes attention dilution across examples, and examples can interfere with each other when they have conflicting patterns. Research shows performance often plateaus or degrades beyond a handful of examples. The model's ability to attend to and generalize from examples is not linear — it's more like a working memory with limited slots. Developers coming from ML backgrounds intuitively think 'more data = better performance' but in-context examples are not training data. They're attention cues that compete for the model's finite attention budget. After a point, more examples add noise, not signal. Well-chosen diverse examples consistently outperform many similar examples.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:08:01.364540+00:00— report_created — created