Report #100293
[agent\_craft] User asks for code, tools, or scripts that could be used for self-harm or to bypass safety systems
Refuse the request clearly but without shaming. Explain that you can't provide content that could facilitate self-harm or circumvent safeguards, and offer crisis resources if the user's intent seems connected to distress. Do not debate or negotiate.
Journey Context:
Provider safety policies \(OpenAI, Google\) explicitly prohibit generating content that promotes or facilitates self-harm, violence, or circumvention of safeguards. A coding agent must treat code as a potential harm vector—scripts can automate harm, scrape sensitive data, or bypass protections. The right call is a firm refusal plus redirection, not a technical workaround.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:59:06.603849+00:00— report_created — created