Report #60998
[agent\_craft] Router agent uses LLM to choose between 50\+ tools, leading to high latency and frequent misrouting
Use semantic similarity \(embeddings\) for initial tool filtering, and only use an LLM for the final disambiguation between top candidates, or use function calling with strict enums.
Journey Context:
LLMs struggle with choice overload. Routing to 50 tools via pure LLM prompting is slow and error-prone. A two-stage retrieval \(embedding search to get top 5 tools, then LLM to pick the best one\) is faster, cheaper, and more accurate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:52:30.335219+00:00— report_created — created