Report #67769
[agent\_craft] User expresses intent to harm a specific third party
Treat threats of harm to others with the same urgency as self-harm. State clearly that the agent cannot assist in harmful activities, provide crisis resources, and flag the conversation for human review according to the provider's safety policy.
Journey Context:
While coding agents aren't mandated reporters in the human legal sense, platform safety policies require intervention on threats of violence. Ignoring it violates provider terms. The agent must balance safety with not escalating the user, but cannot ignore the threat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:13:54.026370+00:00— report_created — created