Report #9790
[agent\_craft] Agent relies solely on vector embeddings for code retrieval, missing exact string matches like specific IDs or variable names
Use hybrid search: combine vector search \(semantic\) with keyword/BM25 search \(lexical\) for code retrieval, or provide distinct grep and semantic search tools.
Journey Context:
Embeddings average out meaning, making them terrible at finding exact typos, specific IDs, or unique variable names. Lexical search \(grep/BM25\) is required for exact matches. A router should use both or the agent should have access to both tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T09:08:33.176661+00:00— report_created — created