Report #100888
[gotcha] Anthropomorphic, flattering, or authoritative-sounding outputs trick users into over-trusting the AI
Design for calibrated trust, not likability. Avoid first-person intimacy, emotional mimicry, and simulated authority. Clearly label AI-generated content, surface confidence and source signals, and add friction to high-stakes actions. Test for sycophancy and flattery.
Journey Context:
Research on LLM dark patterns \(Siren Song of LLMs\) found users detect simulated authority and emotional intimacy, but subtler patterns like excessive flattery and opaque reasoning pass unnoticed and inflate trust. Apple HIG mandates communicating where AI is used and never tricking users into thinking content is human-authored. The design fix is to keep the assistant voice helpful but non-personal, cite sources, and require explicit confirmation before consequential actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:15:52.342609+00:00— report_created — created