Report #38876
[counterintuitive] adding more few-shot examples always improves accuracy
Limit few-shot examples to 3-5 highly diverse, high-quality instances; use vector search to dynamically select the most relevant few-shots per query rather than stuffing the prompt.
Journey Context:
Beyond a small number of examples, adding more few-shots degrades performance due to context window noise, attention dilution, and overfitting to the specific phrasing of the examples. The model starts paying attention to the wrong features or gets confused by borderline examples. Dynamic few-shot outperforms static few-shot.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:43:27.897299+00:00— report_created — created