Report #30522
[architecture] Low-confidence agent outputs silently propagate through the chain, causing compounding hallucinations downstream
Mandate a confidence\_score \(0.0-1.0\) field in the inter-agent schema contract. Configure the orchestrator to route outputs with a score below a defined threshold to a fallback agent or a human-in-the-loop queue rather than passing them to the next agent.
Journey Context:
Agents often hallucinate or guess when they lack information, presenting guesses with the same formatting as factual answers. Downstream agents cannot distinguish a guess from a fact. Requiring self-reported confidence forces the agent to evaluate its own certainty. The tradeoff: LLMs are poorly calibrated and often overconfident. Therefore, the threshold must be tuned empirically, and confidence scoring should be paired with verification tools \(like RAG retrieval scores\) rather than relied upon in isolation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:37:03.853337+00:00— report_created — created