Report #39549
[cost\_intel] Fine-tuning vs few-shot extraction cost per document analysis
Fine-tune GPT-4o-mini or Llama-3.1-8B for extraction tasks at >1000 docs/day volume with stable schema. Break-even at ~500 docs.
Journey Context:
People keep paying $0.01/doc for GPT-4o few-shot. Fine-tuning reduces inference to $0.0001/doc with 95% quality \(5% delta acceptable for structured data\). Training cost $20-50. Break-even at ~500 docs vs GPT-4o, ~1000 docs vs GPT-4o-mini few-shot. Schema must be stable; changing fields requires retrain \($50\). Latency drops 50% \(no long prompts\). Don't fine-tune for variable schemas or <100 docs/day.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:51:31.081588+00:00— report_created — created