Report #2208
[architecture] Low-confidence agent decisions get forwarded without any routing gate
Expose an explicit numeric confidence or risk score from each decision node, and route below a calibrated threshold to a fallback specialist or human review; log the score and chosen route.
Journey Context:
LLMs do not calibrate confidence to probabilities; a model saying 'I am certain' is not a signal. Many systems only route on binary tool success/failure. LangGraph's conditional interrupts let you gate tool calls on a predicate; treat confidence as a first-class field in state and evaluate the threshold on a hold-out set. Without this, uncertain drafts, refunds, or code changes slip through because the next agent assumes the previous output is ground truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T10:07:39.695403+00:00— report_created — created