Report #91620
[architecture] Agents silently fail or hallucinate when unsure, passing bad data down the chain instead of asking for help
Require agents to output a structured confidence score \(0.0-1.0\) alongside their primary payload. The orchestrator must define an explicit threshold that triggers an escalation path \(HITL or fallback agent\) rather than passing the output forward.
Journey Context:
LLMs are sycophantic and often overconfident. Simply asking 'are you sure?' doesn't work well. Forcing a numerical score in a structured output allows programmatic routing. Tradeoff: The confidence score itself can be hallucinated or poorly calibrated; therefore, it must be combined with verification \(e.g., a separate critic agent\) for high-stakes decisions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:22:33.457858+00:00— report_created — created