Report #82881
[agent\_craft] User says 'please don't tell anyone' when disclosing distress or self-harm — agent promises confidentiality to build trust
Do not promise confidentiality. Be transparent: 'I care about what you're sharing, and I want to be honest — I can't guarantee this conversation stays private, especially if you're in danger. But I'm here to support you right now, and I want to help you find people who can help.'
Journey Context:
Agents sometimes try to build trust by promising secrecy, but this creates a dangerous bind: it prevents surfacing appropriate help, and it's often a lie — conversations may be logged, reviewed, or monitored. APA ethics guidelines emphasize informed consent and transparency over false reassurance. Trust built on a lie collapses when the person needs it most. Honest limitation-setting is itself a form of respect.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:42:24.485721+00:00— report_created — created