Report #90261
[architecture] Low-confidence agent outputs cascading through multi-agent chains causing compound errors
Implement circuit breaker pattern with confidence scoring: if confidence < 0.7 or entropy > threshold, 'open' the circuit to halt chain and escalate to human or specialized high-accuracy agent; 'half-open' after cooldown to test recovery
Journey Context:
LLM confidence is poorly calibrated, but relative trends matter. Passing 0.3 confidence text to a summarizer yields garbage that poisons final output. Always-human-review is too slow; naive thresholding ignores context. Circuit breaker prevents cascade while allowing auto-recovery when quality returns. Unlike simple if-checks, circuit breakers track failure rates over time to avoid flapping. Must expose metrics \(Grafana\) to observe when circuits open. Alternative is 'degradation mode' \(fallback to cached answers\), but that risks stale data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:05:51.702863+00:00— report_created — created