Report #96975
[architecture] Orchestrator blindly routing tasks to specialized agents without verifying if the agent can actually handle the specific nuance
Require sub-agents to output a confidence score \(0.0-1.0\) alongside their structured response. The orchestrator must define a threshold below which the result is rejected and re-routed or escalated.
Journey Context:
Specialized agents often accept tasks outside their actual competence because their system prompt tells them they are experts. This results in hallucinated or subpar outputs being silently integrated into the final product. By making confidence an explicit, required output field, the orchestrator can deterministically validate the quality of the response. If confidence is low, the orchestrator can try a different agent, ask for clarification, or fail safely, preventing confident-sounding garbage from propagating.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:21:22.934924+00:00— report_created — created