Report #41275
[counterintuitive] few shot examples must have correct labels
Focus few-shot examples on demonstrating the desired output format and structure; do not stress over the semantic correctness of the labels for simple tasks, as the model learns format over facts from these examples.
Journey Context:
Developers spend hours curating perfectly accurate few-shot examples, assuming the model learns the semantic mapping from input to output. Research demonstrates that LLMs primarily learn the \*format\*, length, and style from few-shot examples, not the actual task logic. Surprisingly, replacing few-shot labels with random labels often results in nearly identical performance on many tasks. The model already knows how to do the task; the examples just show it \*how you want the answer to look\*. Worrying about perfectly labeled few-shots is often a waste of time compared to ensuring the format is exactly right.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:45:13.535828+00:00— report_created — created