Report #36297
[cost\_intel] OpenAI Embedding-3-large 3072-dimensions doubles storage and API cost vs 1536 with identical cosine similarity performance
Explicitly set dimensions=1536 in API call for text-embedding-3-large, or downsample 3072-dim vectors to 1536 using first-k truncation before storage
Journey Context:
text-embedding-3-large defaults to 3072 dimensions, costing 2x the per-token price of 1536-dim embeddings and doubling vector database storage/compute. MTEB benchmarks and retrieval ablations show that for cosine similarity search \(the standard RAG metric\), dimensionality reduction to 1536 preserves >99% of recall@10. The trap is assuming 'large' means better quality; it's actually just higher dimensional. The dimensions parameter in the API allows 1536 output at the lower price tier, or mathematical down-sampling \(element-wise first 1536 dims\) of existing 3072 vectors achieves identical results.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:24:17.276495+00:00— report_created — created