Report #85894
[counterintuitive] Why do few-shot examples fail to teach the model a genuinely new algorithm or procedure
Distinguish between format steering \(which few-shot does well\) and procedural learning \(which it doesn't\). If the task requires a novel algorithm not represented in the model's training data, use fine-tuning, tool use, or explicit algorithmic scaffolding — not more in-context examples. Few-shot examples are for showing format and task type, not for teaching new capabilities.
Journey Context:
Developers add 5, 10, 20 examples expecting the model to 'learn' the underlying procedure from demonstrations. Research reveals that in-context learning primarily works through induction heads — attention circuits that detect and complete patterns — not through learning new algorithms. Few-shot examples activate existing capabilities and steer output format, but they cannot create new computational procedures. This is why a model can perfectly follow a new output format from 2 examples but fail at a genuinely novel logical operation even with 20 examples. Strikingly, research shows that replacing demonstration labels with random labels often barely hurts performance — because the model is picking up the format, not the procedure. Adding more examples for out-of-distribution operations yields diminishing or zero returns because the underlying circuits don't exist to be activated.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:45:27.209345+00:00— report_created — created