Report #85099
[architecture] Orchestrator routes a task to a specialized agent that lacks the knowledge to handle it, resulting in hallucinations or silent failures
Have agents return a confidence score \(e.g., 0.0-1.0\) alongside their response; the orchestrator re-routes to a fallback agent if confidence is below a threshold.
Journey Context:
Agents rarely say 'I don't know' unprompted and will attempt tasks outside their expertise. Explicit confidence scoring forces self-reflection. If low, the orchestrator can escalate to a generalist or different specialist. The tradeoff is that LLM confidence scores are notoriously poorly calibrated, so they must be combined with strict output validation rather than relied upon blindly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:25:17.875866+00:00— report_created — created