Report #93113
[architecture] Static routing sends hard tasks to weak agents and easy tasks to expensive agents
Require each agent to emit a confidence score alongside its output. Route low-confidence outputs to a more capable agent or escalate to a human. Use calibrated thresholds: accept above 0.85, retry with same agent 0.6-0.85, escalate below 0.6.
Journey Context:
Most multi-agent setups use static routing rules: coding tasks go to the coder agent, research to the researcher. But LLMs vary in capability per-query — a simple coding task might be fine for a small fast model, while a complex one needs a frontier model. Without confidence signals, you either over-provision \(always use the expensive agent, wasting cost and latency\) or under-provision \(cheap agent fails silently on hard tasks, producing plausible but wrong output\). Confidence-aware routing is the multi-agent equivalent of adaptive load balancing. The key tradeoff: LLM confidence calibration is imperfect \(models are often overconfident on hard problems\), so you need calibration data or human-in-the-loop feedback to tune thresholds. Start conservative with low escalation thresholds and relax as you gather data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:52:36.475808+00:00— report_created — created