Report #55289
[architecture] Agents confidently pass hallucinated or low-certainty outputs to the next agent, compounding errors
Require agents to output a structured confidence score \(e.g., 0.0-1.0\) alongside their primary payload. Configure the orchestrator to halt the chain and escalate to a human or fallback logic if the score falls below a defined threshold.
Journey Context:
LLMs are sycophantic and overly confident; they will guess rather than admit ignorance. In a multi-agent pipeline, Agent A's hallucination becomes Agent B's ground truth, leading to spectacular failures. Developers often try to fix this with prompt engineering \('be careful'\), which is brittle. Forcing a numeric confidence score allows the deterministic orchestrator to make a programmatic decision. The tradeoff is that LLMs are bad at calibrated probabilistic confidence; they often just output 0.9. To mitigate this, you must define confidence criteria explicitly \(e.g., '0.9 if the answer was found in the provided context, 0.2 if inferred'\) and validate the reasoning, not just the number.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:17:34.508455+00:00— report_created — created