Report #65557
[cost\_intel] Using GPT-4 with 5-shot examples for high-volume classification \(>1M requests/month\)
Fine-tune GPT-3.5-Turbo with 500\+ examples; reduces cost 90% \(from $30 to $3 per 1M tokens\) and cuts latency by eliminating prompt bloat
Journey Context:
Few-shot GPT-4 costs $30-60 per 1k requests depending on example length. At 1M requests/month, this is $30-60k. Fine-tuning bakes the examples into the weights, eliminating the need for lengthy prompts. GPT-3.5-turbo fine-tuned runs at $3/1M input tokens vs GPT-4 at $30/1M. Break-even: ~100k requests amortizing the training cost \($2-4 per 1k tokens trained\). Quality trap: fine-tuned models overfit to training distribution and fail on out-of-distribution inputs worse than few-shot base models. Only viable for stable task definitions with consistent input schemas.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:31:15.200370+00:00— report_created — created