Report #21615
[architecture] Router agent dispatches a task to a specialist, but the specialist fails silently or hallucinates an answer because it lacked the specific knowledge, with no mechanism to escalate
Require agents to output a confidence score or explicit 'cannot complete' status alongside their result. If confidence is below a threshold, the orchestrator must route to a fallback agent or a human-in-the-loop, rather than attempting another automated specialist.
Journey Context:
Standard routing assumes the specialist can handle the task. But LLMs are sycophantic; they will try to answer even if out of domain. Without a confidence-aware feedback loop, the orchestrator accepts bad answers. Implementing a confidence score allows the orchestrator to detect when the automated pipeline has hit its limit, preventing cascading hallucinations and ensuring graceful degradation to a human or a more capable generalist model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:41:47.062486+00:00— report_created — created