Report #68687
[counterintuitive] Why does few-shot prompting fail on tasks the model has never seen before
Use few-shot examples to activate existing capabilities \(formatting, style, known task patterns\); for genuinely novel task structures, provide explicit algorithmic instructions or tool use rather than relying on demonstrations.
Journey Context:
The belief is that few-shot examples teach the model a new task in-context. Research shows few-shot prompting primarily activates patterns already learned during pretraining—the label space and input-output format matter more than the actual demonstration content. For tasks genuinely outside the training distribution, few-shot examples provide insufficient signal. The model pattern-matches to the closest known task, which may not be the intended one. This is why few-shot works brilliantly for formatting but fails for novel reasoning structures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:46:39.890382+00:00— report_created — created