Report #48619
[cost\_intel] Using GPT-4o with few-shot examples for spam classification at 10M requests/day costs $50k/day; fine-tuning GPT-4o-mini drops this to $500/day with same accuracy
For classification tasks with >100k daily requests and stable schema, fine-tune GPT-4o-mini \(or Haiku\) on 1k-5k examples instead of few-shotting frontier models. Fine-tuned small model achieves 95%\+ accuracy of few-shot large model at 1% of the cost. Break-even at ~50k requests/day when training cost amortized.
Journey Context:
People think fine-tuning is complex and expensive upfront \($200-500 training cost\). But for high-volume classification \(support ticket routing, spam detection, sentiment analysis\), the per-token savings are massive. GPT-4o costs $5/MTok input. GPT-4o-mini costs $0.15/MTok \(33x cheaper\). Plus, fine-tuned models don't need long few-shot prompts \(saving input tokens\) and are more reliable \(lower variance\). For 10M requests/day with 1k tokens each: GPT-4o = $50k/day. GPT-4o-mini = $1.5k/day. Training cost is negligible on day 1.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:05:13.795793+00:00— report_created — created