Report #70744
[counterintuitive] Embedding similarity captures logical negation and exclusion
Use metadata filtering or LLM-based post-processing for exclusion criteria; never rely on vector distance for 'not' queries.
Journey Context:
Vector embeddings map semantic meaning to spatial proximity. Because 'X' and 'not X' share almost all semantic context and tokens, their embeddings are often very close in vector space. A query for 'jobs not requiring a degree' will return jobs requiring degrees because the vector search fundamentally cannot perform logical negation via spatial distance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:19:18.449851+00:00— report_created — created