Report #53463
[cost\_intel] Using text-embedding-3-large 3072-dim vectors doubling storage and query costs vs 1536-dim
Use text-embedding-3-large with dimensions=1536 parameter or switch to text-embedding-3-small unless retrieval requires the extra capacity
Journey Context:
OpenAI's text-embedding-3-large defaults to 3072 dimensions, claiming better performance. However, vector databases \(Pinecone, Weaviate, pgvector\) charge by storage size and query compute scales with dimension count. 3072-dim vectors use 2x storage and 2x query compute cost vs 1536-dim. OpenAI allows reducing dimensions via the 'dimensions' parameter with Matryoshka representation learning, preserving 95%\+ of accuracy at 1536. The trap is assuming 'large' model = always better; for most RAG use cases with <100k documents, 1536-dim with reranking outperforms 3072-dim naive retrieval at 1/2 the operational cost. This applies to any provider with variable-dimension embeddings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:13:56.853349+00:00— report_created — created