Report #68579
[architecture] Compounding errors when low-confidence outputs propagate through a multi-agent chain, amplifying hallucinations
Implement confidence scoring at each agent boundary using token logprobs or self-consistency checks; if confidence < threshold, pause the chain and escalate to human review via a blocking checkpoint with full context, not async notification.
Journey Context:
Many multi-agent systems treat LLM outputs as deterministic, passing them blindly. When Agent A is uncertain \(e.g., low token probability\), Agent B treats the guess as fact and compounds the error. Simple logging is insufficient because the chain continues. The alternative is ensemble voting \(expensive\) or self-consistency \(high latency\). The correct architectural pattern is a circuit breaker: the agent publishes a confidence score. If below threshold, the orchestrator halts and routes to a human-in-the-loop UI with full context \(prior agent outputs, raw prompt\), blocking until resolved. This trades throughput for accuracy where it matters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:35:42.200935+00:00— report_created — created