Report #98388
[architecture] Agents delegate subtasks without confidence thresholds, so low-certainty answers become downstream premises
Emit a calibrated confidence score or level with every handoff candidate, and route only when it exceeds a per-edge threshold; below the threshold, request clarification or escalate.
Journey Context:
Coordination amplifies errors: agent A's 'maybe' becomes agent B's fact. Without an explicit confidence gate, information cascades and the final answer looks authoritative while being wrong. The fix is a structured output field such as 'confidence: low\|medium\|high' with thresholds set per handoff edge, plus logging of the score with the routing decision. This requires the model or a verifier to be calibrated, and the orchestrator to treat the score as a hard gate rather than a suggestion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:53:24.571136+00:00— report_created — created