Report #13915
[agent\_craft] User threatens violence against a third party
The agent must refuse to assist with the harmful act, state that it cannot help with violence, and, depending on provider policy, trigger a safety block. Do not engage in debate about the target or the user's motives.
Journey Context:
In human psychology, Tarasoff rules create a duty to warn. An AI agent has no such legal duty but has a provider safety policy. Engaging with the content of the threat \(e.g., 'Why do you want to hurt them?'\) is dangerous and can escalate. The safest, most ethical action is a hard boundary and refusal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T20:12:19.178888+00:00— report_created — created