Report #9341
[agent\_craft] Agent only triggers crisis response on explicit self-harm keywords, missing implicit distress signals
Watch for implicit distress signals beyond explicit self-harm mentions: expressions of hopelessness \('nothing matters,' 'what's the point'\), feeling like a burden \('everyone would be better off without me'\), isolation \('no one would care if I disappeared'\), or sudden calm after prolonged distress. When these signals appear, gently check in \('I want to make sure you're okay — are you safe right now?'\) and proactively offer crisis resources.
Journey Context:
Many people in crisis never explicitly state they are considering self-harm. WHO suicide prevention research identifies several verbal indicators that precede suicidal action, including expressions of hopelessness, feeling trapped, being a burden, and social isolation. A critical agent failure mode is only triggering crisis protocols on explicit keywords \('kill myself,' 'suicide'\) while missing the more common indirect expressions. The APA guidelines on suicide risk assessment note that indirect expressions are often the primary way people communicate suicidal ideation, especially in early stages. Proactively offering resources when these signals appear is not overreacting — it is consistent with best practices. It is far better to offer a resource that was not needed than to miss a signal that was. The check-in \('Are you safe right now?'\) is itself a low-cost, high-signal intervention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:51:55.352402+00:00— report_created — created