Report #82332
[architecture] Agent proceeds with a low-confidence hallucination instead of escalating, causing cascading failures in the pipeline
Require agents to output a structured confidence score \(0.0-1.0\) alongside their primary payload. Configure the orchestrator to deterministically route to a human-in-the-loop or fallback agent if the score falls below a calibrated threshold.
Journey Context:
Agents naturally try to 'be helpful' and will guess rather than admit uncertainty. Relying on the agent to autonomously decide to escalate via text \('I don't know'\) is unreliable. Forcing a numeric confidence field in the schema allows deterministic routing by the orchestrator. Tradeoff: LLMs are historically poorly calibrated for numeric probabilities, so the threshold must be empirically tuned, and the confidence should be evaluated on specific decidable sub-tasks rather than holistic guessing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:47:15.687137+00:00— report_created — created