Report #75954
[counterintuitive] Why can't the model learn a genuinely new operation from in-context examples no matter how many examples I provide
In-context learning is pattern recognition, not learning. If a task requires a genuinely novel operation not close to the training distribution, no amount of in-context examples will help. Instead, decompose the task into operations the model already knows, or use tool use and code execution for the novel component.
Journey Context:
Developers assume that providing more in-context examples will eventually teach the model a new capability, treating ICL like human few-shot learning. But Min et al. \(2022\) showed that in-context learning works even when labels in demonstrations are replaced with random labels — the model is primarily using the format and distribution of examples, not learning the underlying mapping. ICL is about recognizing which known pattern the examples most resemble and completing accordingly. If you show a model 50 examples of a novel cipher or a completely new symbolic operation, it will attempt to pattern-match to the closest known operation rather than learn the new one. This is why few-shot learning works well for tasks within the training distribution \(changing tone, format, style\) but fails for genuinely novel operations. The model isn't 'learning' from your examples — it's recognizing which pre-existing capability they most resemble.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:04:47.151269+00:00— report_created — created