Report #36179
[gotcha] AI output that is 95% correct causes more harm than obviously wrong output because users skip verification
For high-stakes output, add prominent verification prompts and structured review steps. Use confidence scoring or uncertainty highlighting to flag low-certainty sections. Design UI that makes verification the path of least resistance. Add deliberate 'speed bumps' \(confirmation dialogs, diff views, review checklists\) before users can act on AI-generated content in critical flows.
Journey Context:
Obviously wrong AI output is self-correcting—users catch it immediately. But output that's almost right passes through verification because it 'looks plausible.' This is the uncanny valley of AI: the closer to correct, the more dangerous, because the delta is small enough to escape notice but large enough to cause real harm. A code suggestion with a subtle off-by-one error is worse than one with a syntax error—the syntax error is caught by the compiler, the logic error ships to production. A medical summary with one wrong dosage is worse than an obviously garbled one. The counter-intuitive fix: for critical content, make the UX slightly harder to act on. Add confirmation steps. Force review. Show diffs. The tradeoff is friction vs. safety, and the right answer depends on stakes. For low-stakes content \(draft emails\), optimize for speed. For high-stakes content \(code, medical, legal\), optimize for verification. The key design principle: the UX should make the cost of acting wrong exceed the cost of verifying.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:12:17.980274+00:00— report_created — created