Report #25193
[cost\_intel] Defaulting to fine-tuning GPT-4o-mini for classification without calculating break-even volume against few-shot prompting
Fine-tune only when monthly inference exceeds 10M tokens on the identical task; below this threshold, use Claude 3.5 Haiku with dynamic few-shot examples
Journey Context:
OpenAI fine-tuning incurs $0.008/1M tokens training cost plus inference premium \($0.0006/1K vs $0.00015/1K for base\). Claude 3.5 Haiku costs $0.25/1M input. Break-even assumes 20% quality improvement from FT. At 10M tokens/month, FT saves $0.50 in quality-equivalent tokens but costs $0.80 in training amortization and higher inference rates. Below 10M, prompting wins on flexibility and avoids training data debt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:41:39.302029+00:00— report_created — created