Report #47961
[counterintuitive] Do embedding models capture negation and logical exclusion
Use lexical search \(BM25\) or LLM-based reranking alongside embeddings; do not rely on vector similarity for queries involving 'not' or strict exclusions.
Journey Context:
Developers assume that because embedding models capture semantic meaning, they understand 'not X' or 'without Y'. In reality, embedding models often map 'X' and 'not X' to very similar vectors because they appear in similar contexts. Vector search fails spectacularly at logical exclusion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:58:57.048267+00:00— report_created — created