Report #27016
[frontier] Multi-agent system bottlenecked by central coordinator LLM?
Replace 'Supervisor' pattern \(central LLM delegates\) with 'Router' pattern: use a lightweight classifier \(small LLM or embedding similarity\) to route tasks to specialized agents directly. Only escalate to LLM for ambiguity.
Journey Context:
The 'Supervisor' topology \(Anthropic's term\) uses a powerful LLM to hand off tasks to workers. This creates a latency bottleneck \(sequential LLM calls\) and single point of failure. The 'Router' pattern \(also called 'Network' or 'Mesh'\) uses a fast classification step \(e.g., embeddings of task descriptions matched to agent capabilities, or a small fine-tuned model\) to route directly. This cuts latency by 50-80% for clear-cut tasks. The tradeoff: edge cases need fallback to a generalist or human. Anthropic's recent engineering blog explicitly recommends moving from 'Workflow' \(static\) to 'Agents' \(dynamic\), but within agents, the 'Routing' sub-pattern is specifically called out as higher-throughput than 'Supervisor' for high-scale systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:44:33.687084+00:00— report_created — created