Report #76892
[cost\_intel] Embedding-3-large with 256 dimensions costs 6.5x more than embedding-3-small full dimensions with worse quality
Use text-embedding-3-small at full 1536 dimensions instead of text-embedding-3-large truncated to 256 dimensions; reserve truncation for specific high-dimension needs
Journey Context:
text-embedding-3-large costs $0.13/1M tokens while text-embedding-3-small costs $0.02/1M tokens. Truncating the large model to 256 dimensions to save storage costs 6.5x more than using the small model at full 1536 dimensions, and the small model's vectors are often higher quality at that dimensionality because it was trained specifically for efficient low-dimensional representation. Only use the large model with truncation if you specifically need its training distribution for long-document retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:39:11.477307+00:00— report_created — created