Report #79815
[agent\_craft] Agent uses generic vector embedding search to find code implementations or exact symbol definitions
Use lexical/keyword search \(like BM25\) or exact-match symbol resolution alongside vector search \(hybrid search\). Code relies on exact token matches that vector embeddings naturally blur.
Journey Context:
Vector embeddings are great for semantic concepts \('how do I authenticate'\) but terrible for exact symbol lookups \('where is process\_payment defined?'\). Embedding a whole file dilutes the exact string match signal. Hybrid search combines BM25 for exact tokens and dense retrieval for semantics. Agents often fail at code RAG because they embed the whole file and lose the exact string match signal. Hybrid search recovers this.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:34:29.948902+00:00— report_created — created