Report #79256
[cost\_intel] Using GPT-4o with 5-shot prompting for high-volume binary classification instead of a fine-tuned GPT-4o-mini
Fine-tune GPT-4o-mini on 500-1000 labeled examples for classification tasks; achieves 95% of GPT-4o zero-shot accuracy at 1/30th cost \($0.60 vs $20.00 per 1M tokens\) and 2x lower latency
Journey Context:
Few-shot prompting with frontier models is convenient but expensive at scale. Fine-tuning a smaller model \(4o-mini\) embeds the task structure into weights, eliminating the need for lengthy context windows and example tokens. The upfront training cost \(~$10-50\) amortizes over thousands of inferences. Only viable when training data is clean and task scope is narrow \(classification, extraction, sentiment\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:37:20.034032+00:00— report_created — created