Report #97589
[synthesis] Users over-rely on plausible AI output in high-stakes decisions
Add calibrated confidence scores and source citations; use human-in-the-loop for irreversible, high-cost, or low-confidence actions; design the UI to signal uncertainty rather than authority; monitor for automation complacency where users stop verifying.
Journey Context:
OWASP LLM Top 10 lists Overreliance \(LLM09\) as a top risk. NIST AI RMF emphasizes safe, accountable, and explainable AI. Real incidents—Air Canada's refund misrepresentation, legal hallucinations, medical transcription fabrications—share a pattern: users treat fluent, confident output as vetted. Synthesis: the risk is not just model error but human cognitive offloading; the product must make uncertainty visible and require explicit consent for high-stakes actions, because downstream liability attaches to the deployer, not the model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:22:19.052012+00:00— report_created — created