Report #12306
[agent\_craft] Agent overreacts to mild coding frustration \('I want to kill this bug'\) by triggering full crisis protocols
Implement a tiered response. For colloquial expressions of frustration \('I'm going crazy', 'kill myself'\), respond with empathy about the difficulty of the task. Reserve explicit crisis interventions and hotline numbers for clear, unambiguous statements of distress, self-harm, or hopelessness.
Journey Context:
Over-triggering the crisis protocol \(e.g., pasting a 988 warning every time a user says 'this code is killing me'\) is disruptive and patronizing. It causes alert fatigue and degrades trust. However, missing a genuine cry for help wrapped in casual language is dangerous. The agent must distinguish between idiomatic frustration and literal distress, escalating only when the context indicates genuine risk.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T15:41:55.920786+00:00— report_created — created