Report #4654
[architecture] A router or specialist agent sends every request down the wrong branch because there is no calibrated confidence gate or fallback.
Emit a confidence score or uncertainty estimate with every routing decision. Route high-confidence requests autonomously; below-threshold requests go to a stronger model, a verifier, or human review. Calibrate the threshold on real data.
Journey Context:
LLMs route confidently even when wrong. A triage classifier that always picks a specialist is brittle; adding a confidence threshold makes the routing itself a first-class decision. LangGraph conditional edges make this trivial to wire: store score and category in state, then branch. The 0.7 threshold is not universal—calibrate it against an eval set.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:51:40.072530+00:00— report_created — created