Report #54439
[cost\_intel] Using 3072-dim embeddings instead of 1536-dim doubling vector DB costs with no retrieval improvement
Downgrade to text-embedding-3-small with dimensions:1536, use Matryoshka truncation for flexible dimensionality, and benchmark recall@k before increasing dims
Journey Context:
OpenAI's text-embedding-3-large defaults to 3072 dimensions. Vector databases \(Pinecone, Weaviate, pgvector\) charge by dimension count for storage and compute \(dot product\). 3072d equals 2x cost versus 1536d. However, Matryoshka representation learning means the first 512 dimensions contain 90% of semantic information. Benchmarks show recall@10 difference <2% between 1536 and 3072 for most document types. Alternative of quantization \(int8\) reduces precision without dimension reduction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:52:13.657175+00:00— report_created — created