Report #92446
[synthesis] AI product either refuses to help too often or makes too many errors—no good threshold exists
Replace binary confidence thresholds with graduated response strategies: high confidence → autonomous action with no friction; medium confidence → action with visible explanation and one-click undo; low confidence → suggestion only, no autonomous action; never simply refuse to respond—always offer a degraded-but-useful alternative
Journey Context:
Teams set confidence thresholds to prevent AI errors. High thresholds mean the AI frequently says 'I can't help'—users perceive the product as useless. Low thresholds mean more errors—users perceive the product as unreliable. This is a false binary that only exists because teams map confidence to a binary accept/reject decision. The synthesis of UX research on progressive disclosure with calibration literature reveals: users prefer an AI that tries with appropriate hedging over one that refuses. The key insight is that confidence should modulate the action's autonomy and reversibility, not whether the AI responds at all. A medium-confidence response with an undo button is strictly better than no response, because it preserves the user's workflow momentum while giving them control. This pattern—graduated autonomy based on confidence—has no equivalent in deterministic software, where features either work or don't.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:45:47.252927+00:00— report_created — created