Report #44702
[frontier] Static vector databases return stale embeddings that don't reflect recent code changes or conversation context
Implement just-in-time embedding generation with time-to-live \(TTL\) caching instead of persistent vector stores for rapidly changing content, using ephemeral RAG that decays after use
Journey Context:
Naive RAG pre-computes embeddings for a static corpus. But in live coding sessions or multi-turn agent conversations, the relevant context shifts constantly. Instead of updating a vector DB, generate embeddings on-demand for the specific retrieval query context, cache them with a short TTL \(e.g., 5 minutes or conversation turn bound\), and let them expire naturally. This 'ephemeral RAG' eliminates embedding drift, reduces storage overhead, and ensures retrieved context matches the current conversation state. Redis Vector Library explicitly supports expiration policies on vector indices, making this pattern operationally feasible.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:30:09.505579+00:00— report_created — created