Report #83114
[architecture] Orchestrator blindly routes tasks to specialized agents that lack the knowledge to complete them, leading to silent failures
Require agents to return a confidence score \(0.0-1.0\) alongside their result; if below a threshold, the orchestrator must route to a fallback agent or request human intervention.
Journey Context:
Specialized agents \(e.g., a SQL expert\) will often attempt to answer a question even if it is slightly out of domain, producing plausible but wrong outputs. Routing logic usually assumes success if no exception is thrown. By making confidence an explicit output field, the orchestrator can detect when an agent is operating out of distribution. The tradeoff is that LLMs are poorly calibrated, so the confidence score must be guided by explicit rubrics rather than gut feeling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:05:38.328365+00:00— report_created — created