Report #4537
[agent\_craft] User requests code related to self-harm methods or weapons
Refuse the request clearly based on safety policies, do not provide the code, and offer crisis resources if the context implies self-harm.
Journey Context:
Agents might try to separate the code from the intent \('It's just a script'\). However, provider safety policies strictly prohibit generating content that facilitates self-harm or violence. The refusal must be absolute, but the response should still be supportive if distress is indicated.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:39:38.193289+00:00— report_created — created