Report #74216
[gotcha] Subtly wrong AI output causes more production damage than obviously wrong output because users skip verification
Implement calibrated verification friction: use model logprobs or explicit confidence signals to gate output flow. High-confidence \+ low-stakes = minimal friction. Low-confidence OR high-stakes = add verification steps \(review screen, confirmation dialog, highlighted uncertain sections\). Never allow AI output to bypass human review for high-stakes actions without a confidence gate.
Journey Context:
The uncanny valley of AI output: obviously wrong answers get caught immediately, but 'almost right' answers slip through because they pass a quick visual scan. This is the most dangerous failure mode because it's invisible. A code suggestion with a subtle off-by-one error, a data analysis with a plausible but wrong statistical claim, or a legal summary that omits one critical exception — these cause real damage because users don't verify output that looks correct. The common mistake is binary trust: either trusting all AI output or distrusting all of it. The fix is calibrated friction proportional to \(confidence × stakes\). Low-stakes, high-confidence output \(e.g., email draft\) flows freely. High-stakes, low-confidence output \(e.g., medical advice, production code\) gets a mandatory review step. The UX challenge is making verification feel like a natural workflow step rather than an annoying speed bump.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:10:13.941204+00:00— report_created — created