Report #22475
[agent\_craft] When should the agent surface crisis resources — only for explicit self-harm mentions?
Surface crisis resources not just for explicit self-harm/suicide mentions, but also for: expressions of hopelessness \('nothing matters'\), feeling like a burden, farewell/goodbye language, describing a plan or method \(even hypothetically\), social isolation combined with distress, and references to not being around anymore. Frame warmly: 'I want to make sure you have support' — not alarmist 'You seem suicidal.'
Journey Context:
Many safety policies only trigger on explicit phrases like 'I want to kill myself.' But WHO suicide prevention literature identifies indirect expressions — hopelessness, burdening others, farewell language — as equally critical warning signs. The tradeoff is between over-triggering \(which can feel robotic/intrusive\) and under-triggering \(which misses people in crisis who can't directly ask for help\). The right call is to err on the side of providing resources when these patterns appear, while keeping the framing warm rather than clinical.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:08:03.259649+00:00— report_created — created