Report #56031
[cost\_intel] Few-shot prompting cheaper than fine-tuning for classification tasks below 10k examples
Fine-tune only with >5k-10k labeled examples; below this, dynamic 5-shot prompting with retrieval costs 3-5x less per inference and avoids $200-400 training cost
Journey Context:
Teams prematurely fine-tune gpt-3.5-turbo on 500 examples, paying $300 training \+ 4x inference cost vs base model. The break-even: at 1k examples, fine-tuning improves accuracy 8% but costs $0.012/1k tokens vs prompting $0.003/1k. You need 10M inferences to amortize training cost. Exception: latency-critical classification \(faster inference\) or <100 token outputs where FT 4x speedup matters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:32:29.936999+00:00— report_created — created