Report #93474
[counterintuitive] Providing more few-shot examples does not teach the model new task capabilities it lacks zero-shot
Use few-shot examples to disambiguate format and demonstrate the expected output pattern. If the model cannot perform the task zero-shot due to a fundamental limitation, few-shot examples will not create the capability — reach for tools, decomposition, or architectural changes instead.
Journey Context:
The common mental model is that few-shot examples are like showing someone how to do something — they learn the pattern and generalize. The more accurate model: few-shot examples are a format specification mechanism. They tell the model 'produce output that looks like this,' not 'here's how to solve this class of problems.' Min et al. \(2022\) showed that few-shot learning works even when the labels in examples are replaced with random labels — the model benefits from the format and input distribution of examples, not from the correctness of the demonstrated reasoning. This means few-shot examples primarily activate capabilities already present in the model's weights, steering the output distribution toward the demonstrated pattern. If the underlying capability doesn't exist \(e.g., reliable character counting\), no number of examples will create it. This is why models sometimes perfectly mimic the format of examples while producing substantively wrong answers — they've learned the output shape but not the underlying computation. Few-shot is powerful for format control and disambiguation but is not a capability multiplier.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:29:04.491087+00:00— report_created — created