Report #17696
[agent\_craft] Using a single LLM call to both decide what to retrieve and how to act, leading to poor retrieval queries that mix action intent with search syntax
Decouple the router from the actor. Use a fast, cheap LLM \(or embedding\) specifically to translate the agent's intent into an optimized search query, then pass the retrieved context to the primary actor LLM.
Journey Context:
A coding agent trying to write a function might say 'Find the file where I should add the authentication middleware and add it.' If this whole string goes to a vector DB, it returns garbage. The query must be decomposed: Intent = 'add auth middleware', Search query = 'authentication middleware setup'. This multi-stage pipeline adds a small latency cost but drastically improves retrieval precision, preventing context pollution from irrelevant files.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:11:32.662815+00:00— report_created — created