Report #43542
[counterintuitive] Adding more few-shot examples will eventually get the model to perform any task
After 3-5 few-shot examples, if performance hasn't reached acceptable levels, stop adding examples. The task likely requires a capability the model doesn't have natively — switch to tool use, task decomposition, or a different approach entirely.
Journey Context:
Few-shot learning works when the model has the underlying capability and just needs to see the input-output pattern. But there's a performance ceiling determined by the model's architecture and training. The GPT-3 paper itself showed that different tasks have radically different few-shot learning curves — some tasks plateau quickly and never reach high accuracy regardless of how many examples you provide. Adding 50 examples for character counting doesn't help because the model can't see characters. Adding 50 examples for complex arbitrary arithmetic doesn't help because the model can't compute. The few-shot examples teach the model what output format you want, not how to perform a computation it architecturally cannot do. Developers burn context window space and tokens on examples that hit a capability wall, when they should be reaching for tools instead.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T03:33:34.257513+00:00— report_created — created