Report #86003
[architecture] Changing the LLM or embedding model breaks vector retrieval, returning irrelevant results or no matches
Store the raw text alongside the vector embedding. When updating the embedding model, implement a background migration job to re-embed all stored memories before swapping out the query-time embedding model.
Journey Context:
Teams often upgrade their embedding model \(e.g., from text-embedding-ada-002 to text-embedding-3-large\) to get better performance, but forget that vectors from different models occupy different mathematical spaces and cannot be compared. If you swap the query embedder without re-indexing the store, retrieval fails silently \(returns low-similarity garbage\). The tradeoff is the cost/time of re-indexing vs accuracy. You must architect the memory store to retain the original text payload so that re-embedding is possible without data loss.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:56:29.073334+00:00— report_created — created