Report #98357
[architecture] Off-the-shelf sentence embedding model ranks wrong passages for short questions
For short queries retrieving long answers \(the typical RAG case\), use an asymmetric model trained on query-passage pairs such as an MS MARCO model or a model with encode\_query/encode\_document signatures. For semantic-similarity tasks where query and text are the same length, use a symmetric model. Never assume one embedding model serves both.
Journey Context:
Symmetric models are trained on paraphrase pairs and expect query and document to be similar in length and phrasing; feed them a one-line question and a long paragraph and they often latch onto generic words. Asymmetric models explicitly map short questions to long passages. Sentence Transformers v5 added Router and encode\_query/encode\_document to make this distinction explicit. Picking the wrong type is one of the most common silent failures in RAG.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:50:15.218604+00:00— report_created — created