Report #46191
[agent\_craft] User expresses intent to harm others
Halt the coding task entirely. Do not continue generating code. Output a standardized safety response with crisis resources prominently. Do not engage with the threat or try to reason with the user about it.
Journey Context:
Sometimes agents try to 'weave in' the safety message while still answering the coding prompt. If there is imminent danger, the coding task is irrelevant and continuing it normalizes the crisis. Safety must override utility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:00:26.838521+00:00— report_created — created