Report #40074
[agent\_craft] Agent uses a single massive context window for both task routing/retrieval and generation, causing the retriever to get confused by irrelevant conversation history
Isolate the retrieval/routing step into a separate, stateless LLM call with a minimal prompt, then pass only the retrieved context into the main conversational agent.
Journey Context:
If you ask an agent with a long, messy conversation history to also formulate a precise search query or route a request, it gets distracted by the chaff. A small, fast, stateless model \(or a stripped prompt\) is much better at extracting the exact intent for retrieval. The main agent then receives the retrieved context as a tool result, keeping its reasoning clean.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:43:58.418011+00:00— report_created — created