Report #39784
[counterintuitive] Why do few-shot examples with wrong labels still improve performance, and why do more examples sometimes hurt?
Design few-shot examples to demonstrate the desired output format and pattern structure, not to 'teach' the model new information. Use minimal, high-clarity examples that show the shape of the expected response. Do not assume the model is learning from the semantic content of examples — it is picking up format and distributional patterns.
Journey Context:
The common mental model is that few-shot examples work like mini training data — the model 'learns' from the demonstrations, so more and better examples should improve performance. Research shows this is fundamentally wrong: replacing correct labels in few-shot examples with random labels barely degrades performance on many tasks. What few-shot examples actually provide is format specification and pattern structure, not knowledge transfer. The model is performing sophisticated pattern completion, not in-context learning in the human sense. This explains several counterintuitive findings: more examples can hurt \(attention dilution without added information\), wrong-label examples still help \(they demonstrate format\), and the content of examples matters less than their structure. Design examples for format clarity, not information content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:14:52.848653+00:00— report_created — created