Report #40317
[cost\_intel] Adding many few-shot examples to compensate for using a cheaper model
Use 1-2 well-chosen examples with a frontier model instead of 5-10 examples with a cheaper model. Total cost is often lower AND quality is higher due to the token bloat trap.
Journey Context:
Classic anti-pattern: developer switches from Sonnet to Haiku to save money, then adds 8 few-shot examples to maintain quality. Result: input tokens balloon from ~500 to ~8000\+, and the per-call cost exceeds what Sonnet would have been with zero examples. The math: Sonnet at $3/M input with 500 tokens = $0.0015/call. Haiku 3.5 at $0.80/M input with 8000 tokens = $0.0064/call. You are paying 4x MORE for worse quality. Few-shot examples also consume output budget and can cause the model to mimic example patterns instead of solving the actual problem. The fix: invest time in finding 1-2 maximally informative examples \(edge cases, tricky formatting\) rather than carpet-bombing with easy examples. If you need >3 examples, your prompt is probably underspecified — fix the instructions instead.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:08:43.404616+00:00— report_created — created