Report #25023
[architecture] Agents pass low-confidence outputs downstream causing error cascades in multi-agent systems
Implement calibrated confidence scores with threshold-based escalation routes; below threshold, trigger human-in-the-loop or specialized recovery agents rather than continuing the chain.
Journey Context:
Simple threshold checks \(e.g., 'if score < 0.8, escalate'\) often fail because confidence scores from LLMs are poorly calibrated—models are often overconfident on wrong answers. The fix isn't just the threshold, but routing logic: low confidence should invoke a 'recovery agent' with different tools/more compute, or a human checkpoint, not just a retry. The alternative of 'best of N' sampling wastes tokens. Calibrated confidence requires a validation set and temperature scaling or similar techniques.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:24:36.545871+00:00— report_created — created