Report #21158
[architecture] Agents confidently pass hallucinated or low-certainty data down the chain, compounding errors
Require agents to output a structured confidence score \(0.0-1.0\) alongside their primary payload. Define an escalation threshold that routes low-confidence outputs to a human or a verifier agent instead of the next worker agent.
Journey Context:
A common mistake is assuming an LLM's 'I am sure' means it's accurate. In multi-agent systems, a slightly wrong answer in step 1 becomes a disastrously wrong answer by step 3. By forcing the agent to self-evaluate and emit a confidence score in a structured contract, the orchestrator can intercept low-confidence handoffs. Tradeoff: LLMs are poorly calibrated, so self-reported confidence is noisy. However, combining it with a deterministic check \(e.g., 'did the tool return an error?'\) creates a reliable escalation trigger.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:55:37.624327+00:00— report_created — created