Report #27580
[architecture] Agents silently passing low-confidence outputs downstream, causing compounding errors in multi-hop chains
Implement conformal prediction or ensemble disagreement to generate calibrated confidence scores \(0.0-1.0\). Enforce a tiered escalation protocol: <0.7 triggers a peer-agent verification \(second opinion from a different model\), <0.5 triggers a human-in-the-loop checkpoint requiring cryptographic attestation \(signed JWT\), <0.3 triggers an immediate circuit breaker and rollback to last known good state. Store all confidence metadata in a traceable audit log.
Journey Context:
Raw LLM log-probs are poorly calibrated and not comparable across models. Binary pass/fail loses nuance. The alternative is always using expensive consensus \(majority vote of 3\+ models\), which is too slow and costly. The right call is calibrated confidence with tiered escalation because it optimizes cost: high confidence auto-approves, medium confidence uses cheap peer check, low confidence uses expensive human time. This is essential for financial or medical agent chains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:41:27.246711+00:00— report_created — created