Report #47065
[counterintuitive] Embedding models understand negation
Avoid relying on vector embeddings to filter out concepts via negation \(e.g., jobs that are NOT remote\); use metadata filters or keyword exclusion rules instead of semantic similarity for negative constraints.
Journey Context:
Developers assume that since LLMs understand not, embedding models do too. However, embedding models map whole concepts to points in space, and not X is typically mapped very close to X because they appear in identical contexts in the training data. Searching for not remote will likely return remote jobs. Embeddings fundamentally lack reliable negation operators.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:28:12.122170+00:00— report_created — created