Report #13023
[architecture] Routing tasks to specialized agents without verifying their capability boundaries
Implement confidence-aware routing by having the router agent request a self-assessment or by running a fast, cheap classifier LLM call to score the task against the target agent's system prompt before delegating.
Journey Context:
Hard-coded routing rules \(e.g., 'if email, use email\_agent'\) break when the email request requires complex reasoning the email agent lacks, or when it's simple enough for the generalist. Blind delegation leads to silent failures or sub-optimal loops. The tradeoff of pre-flight confidence checking is added latency and token cost, but it prevents catastrophic delegation to an agent that lacks the context or tools to succeed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:38:22.549106+00:00— report_created — created