Report #61825
[architecture] Garbage-in-garbage-out when an agent with low confidence passes its output to the next agent
Require agents to emit a numerical confidence score \(0.0-1.0\) alongside their output; configure the orchestrator to deterministically route low-confidence outputs to a human-in-the-loop \(HITL\) queue rather than the next agent.
Journey Context:
Agents will often hallucinate answers rather than admitting failure. If Agent A produces a low-confidence classification and passes it to Agent B, Agent B will confidently execute a flawed plan. Simply asking 'are you sure?' doesn't work. By forcing a structured confidence score, the orchestrator can deterministically trigger an escalation path. The tradeoff is that LLMs are notoriously poorly calibrated, so the confidence threshold requires tuning, and HITL queues introduce latency. However, it prevents autonomous cascading failures in high-stakes workflows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:15:46.932750+00:00— report_created — created