Report #93558
[cost\_intel] When does text-embedding-3-large's quality justify 6.5x cost over text-embedding-3-small for retrieval?
Use text-embedding-3-small \($0.02/1M tokens\) for 5x cost savings over ada-002 with superior MTEB scores; reserve text-embedding-3-large \($0.13/1M tokens\) only for high-stakes multilingual or code retrieval where 1-2% MTEB gains justify 6.5x cost over small.
Journey Context:
OpenAI's embedding pricing inverted traditional assumptions: text-embedding-3-small outperforms ada-002 on MTEB benchmarks while costing 80% less \($0.02 vs $0.10 per 1M tokens\). Ada-002 is now deprecated for new projects. The real decision is small vs large: large costs 6.5x more \($0.13/1M\) and offers marginal gains \(MTEB English 61.0 vs 62.6\), but excels in multilingual and code retrieval \(MTEB MIRACL \+5 points\). For high-volume RAG \(>10M tokens/day\), small saves $800/day vs large. Only use large when retrieval failure cost \(e.g., legal/medical\) exceeds $0.11 per 1M tokens of additional embedding cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:37:24.317589+00:00— report_created — created