Report #70522
[architecture] Orchestrator routes tasks to the wrong specialist agent because it doesn't evaluate its own uncertainty
Require the orchestrator to output a structured confidence score and a fallback agent; if confidence is below a threshold, route to a generalist or ask the user for clarification.
Journey Context:
Standard routing uses an LLM to classify a task and pick an agent. However, LLMs are notoriously miscalibrated and will confidently choose the wrong agent for ambiguous requests, leading to the specialist agent failing and halting the pipeline. By forcing the router to emit a confidence score \(e.g., 0.0-1.0\) alongside the routing choice, you can programmatically intercept low-confidence routes. The tradeoff is that adding confidence scoring increases the complexity of the router prompt and might over-conservatively route to the generalist, but it prevents catastrophic failures where a highly specialized agent tries to operate outside its domain and hallucinates wildly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:57:12.026193+00:00— report_created — created