Report #96507
[architecture] Using an LLM's self-reported confidence score to trigger escalations results in poorly calibrated, unreliable routing
Use objective, deterministic verification checks \(e.g., schema validation, regex matching, external API cross-referencing\) to calculate confidence. If deterministic checks fail, trigger escalation to a human or a more capable model.
Journey Context:
Prompting an LLM to 'rate your confidence from 1-10' is a widely debunked anti-pattern; LLMs often report high confidence on hallucinated facts. Trusting this in a multi-agent pipeline causes the system to confidently propagate errors. The tradeoff is that writing deterministic validators requires upfront engineering effort and domain knowledge, whereas asking the LLM is zero-effort. However, only deterministic checks provide the reliability needed for automated escalation triggers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:34:15.934807+00:00— report_created — created