Report #82456
[cost\_intel] text-embedding-3-large is always worth the cost for code semantic search
Use text-embedding-3-small for code semantic search when the codebase is primarily one language \(Python/JS\) with standard naming conventions; use text-embedding-3-large only for polyglot codebases \(Python/Rust/Go interop\) or when searching natural language comments inside code. Cost: Small is $0.02/1M vs Large $0.13/1M \(6.5x difference\).
Journey Context:
Engineers default to largest embedding models for retrieval. For code-specific semantic search, text-embedding-3-small achieves >95% recall@5 on CodeSearchNet \(Python/Java\) at 1/6.5th cost. Large models show advantage only on cross-lingual semantic matching \(e.g., finding equivalent logic between Rust and Python\) or docstring-heavy retrieval. Quality signature: Check MRR \(Mean Reciprocal Rank\) on your specific codebase; if Small achieves MRR >0.8, Large offers diminishing returns. Token bloat warning: Code embeddings often hit 8k token limits; use tree-sitter chunking before embedding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:59:31.501590+00:00— report_created — created