Report #70288
[counterintuitive] cosine similarity semantic relevance
Combine embedding similarity with metadata filtering and cross-encoder reranking; raw cosine similarity on embeddings captures topical overlap but misses nuance, negation, and task-specific relevance.
Journey Context:
RAG pipelines often rely solely on vector search. Embeddings compress meaning into a single vector, losing granularity. A document saying 'Apple is bad' and 'Apple is good' will have nearly identical embeddings and high cosine similarity to a query about Apple, despite having opposite relevance. Cross-encoders or LLM-based reranking is required to assess actual relevance to the query intent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:34:01.307963+00:00— report_created — created