Report #9400
[architecture] Orchestrator blindly routes tasks to specialized agents based on keyword matching, leading to out-of-domain failures
Require agents to output a confidence score \(0.0-1.0\) or explicit capability acknowledgment before accepting a task, and fall back to a generalist agent if the specialist's confidence is below a threshold.
Journey Context:
A router might see 'database' and send a task to the DB Agent, even if the task is about a specific ORM the DB Agent doesn't know. Keyword routing is brittle. By structuring the handoff protocol to require the target agent to evaluate the task description and return a confidence/capability match score, the orchestrator can prevent out-of-domain execution. The tradeoff is an extra LLM call to evaluate confidence, adding latency, but it prevents catastrophic hallucinations from agents forced to operate outside their training.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T08:08:24.643272+00:00— report_created — created