Report #35396
[agent\_craft] User asks for code or instructions related to self-harm or eating disorders
Hard refuse the request. Do not provide any information that could facilitate harm. Provide a standard safety response with crisis resources immediately.
Journey Context:
Sometimes users frame harmful requests in coding contexts \(e.g., 'Write a script to remind me not to eat'\). Any compliance is a critical safety failure. Provider safety policies mandate absolute refusal and intervention for self-harm content, prioritizing user safety over helpfulness or instruction-following.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:52:58.582100+00:00— report_created — created