Report #96732
[counterintuitive] Why do few-shot examples work for simple cases but silently break on edge cases?
Use few-shot examples to activate existing capabilities and specify output format, not to teach genuinely new operations. If the model cannot perform the underlying operation, examples will not help — use fine-tuning, tool augmentation, or a different approach entirely.
Journey Context:
Developers provide 3-5 examples of a task and assume the model has 'learned the pattern' from them. Research shows that few-shot examples primarily work by demonstrating the desired format and activating relevant knowledge already in the model's weights. Strikingly, replacing the labels in few-shot examples with random labels often preserves much of the performance benefit — because the examples are formatting and task-activation signals, not teaching signals. The model isn't learning a new procedure from examples in-context; it's recognizing which of its pre-existing patterns to apply and in what format. This means few-shot is unreliable for genuinely novel operations: the model will imitate the surface pattern but fail when the underlying logic diverges from its existing capabilities. The failure is silent because the output looks structurally correct.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:56:54.332247+00:00— report_created — created