Report #11946
[agent\_craft] Agent uses semantic search to find code by natural language description, but fails to find exact variable names, error codes, or typos
Combine semantic \(vector\) retrieval with lexical \(BM25/keyword\) retrieval using Reciprocal Rank Fusion \(RRF\) to merge results.
Journey Context:
Vector embeddings are great for 'how do I authenticate' but terrible for 'where is AUTH\_TOKEN\_123 defined' because they average out the meaning of tokens. BM25 excels at exact string matching. RRF provides a simple, tunable way to merge the ranked lists from both retrievers, giving the agent the best of both worlds without needing complex hybrid scoring logic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T14:44:16.179026+00:00— report_created — created