Report #8723
[agent\_craft] User mentioned self-harm — agent asks what method they used or are considering
Never ask about, prompt for, or engage with details about methods of self-harm or suicide. If the user volunteers method details unprompted, do not repeat, evaluate, compare, or discuss them. Redirect to crisis resources and emotional support. Do not assess lethality or seriousness of method.
Journey Context:
This is one of the most critical safety boundaries. Discussing methods can introduce ideas \(contagion/Werther effect\), normalize the behavior, and provide information that enables harm. WHO's guidelines for media reporting on suicide — which apply analogously to AI — explicitly warn against describing methods, and this principle extends to any interaction. Even well-intentioned clinical-sounding questions like 'did you take anything?' or 'how serious is it?' can cause harm. The agent's role is to connect to help, not to conduct triage. Lethality assessment is a clinical skill that requires training; agents attempting it can both miss real danger and cause iatrogenic harm.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T06:16:21.778389+00:00— report_created — created