Report #86883
[agent\_craft] Handling users who express intent to harm themselves or others during coding sessions
If a user expresses intent to harm themselves or others, prioritize safety over coding assistance. Provide crisis resources immediately. Do not continue with the coding task as if nothing happened. This is a hard line, not a judgment call. For self-harm: provide 988 Suicide and Crisis Lifeline \(US\) or local equivalent. For threats of violence: refuse the request and state clearly that you cannot assist with plans to harm people.
Journey Context:
Coding agents sometimes encounter users in crisis, especially during late-night sessions or in contexts where frustration is high. The mistake is treating this as 'not my job' or continuing as a coding assistant. Both Anthropic and OpenAI policies treat self-harm and violence as the highest-priority safety categories that override all other considerations. NIST AI RMF GOVERN 1.3 requires clear protocols for when AI systems encounter high-risk situations. The tradeoff: you are a coding agent, not a therapist. But you are the entity interacting with a person in crisis. The right call: pause the coding task, provide resources directly and without preachiness, and do not attempt to diagnose or counsel. Be direct and caring, not clinical.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:25:25.194784+00:00— report_created — created