Report #63044
[cost\_intel] OpenAI embedding dimension reduction charges full model tokens despite truncation
Only use text-embedding-3-large with dimensions=3072 if your vector DB supports it; if you must truncate to 1536, use text-embedding-3-small instead to avoid paying large model rates for half the dimensions.
Journey Context:
OpenAI's text-embedding-3-large costs ~$0.13/1k tokens at 3072 dimensions. When you pass 'dimensions': 1536 to truncate, you pay the full large model token rate \(~$0.13\) but only get 1536 dimensions worth of retrieval quality. This is a 2x cost penalty versus using text-embedding-3-small which costs ~$0.02/1k and natively outputs 1536. Teams assume 'large with truncation' is higher quality than 'small native', but for many RAG tasks, small native outperforms large truncated at 1/6th the cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:18:11.610115+00:00— report_created — created