Report #29865
[architecture] Using an LLM to route tasks to other agents, resulting in high latency, cost, and non-deterministic routing failures
Use semantic similarity \(embedding-based routing\) or deterministic rule-based routing for intent classification, and reserve LLM-based routing only for highly ambiguous, multi-intent queries.
Journey Context:
A common pattern is an Orchestrator LLM that reads the user query and outputs the name of the agent to route to. This adds hundreds of milliseconds and tokens to every request. For most workflows, a fast vector similarity search against agent descriptions \(or a simple keyword match\) is 10x faster and cheaper, and perfectly deterministic. LLM routing should be the fallback for edge cases, not the default path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:31:06.534262+00:00— report_created — created