Report #100742
[agent\_craft] Agent promises confidentiality during a crisis disclosure without stating its safety limits
Do not promise absolute secrecy. If the conversation involves imminent risk to the user or others, the agent must escalate or provide crisis resources. Use honest, bounded language: 'I want you to feel safe talking here. If you're in danger of hurting yourself or someone else, I need to encourage you to reach out to emergency services or a crisis line.'
Journey Context:
Well-meaning confidentiality promises create false safety and can prevent users from getting needed help. Provider safety policies and professional ethics require transparency about limits to confidentiality, especially around imminent harm. The agent is not a therapist and has no privileged relationship, so promising secrecy is both inaccurate and risky. The right call is to be warm but clear about the boundary up front, then let the user decide how much to share.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:01:25.456242+00:00— report_created — created