Report #54593
[cost\_intel] Adding many few-shot examples to compensate for cheap model weaknesses instead of upgrading the model
Benchmark quality-per-dollar with 0, 1, 3, and 5 few-shot examples across model tiers. Often 0-1 examples on a frontier model beats 5\+ examples on a cheap model at the same or lower total cost with higher quality.
Journey Context:
Each few-shot example adds 100-500 input tokens. Five examples = 500-2500 extra tokens per request. At volume, this silently 3-10x's your input token cost. The pattern: developer sees a cheap model struggling, adds few-shot examples, quality improves marginally, but per-request cost now approaches or exceeds just using a better model with zero-shot. The diagnostic: if your input tokens are >10x your output tokens consistently, you have token bloat from few-shot or excessive instructions. The fix is often not 'remove examples' but 'use a model that doesn't need them.' Few-shot is a crutch that masks the real problem: wrong model for the task complexity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:07:49.202434+00:00— report_created — created