Report #11549
[agent\_craft] Agent sounds preachy or judgmental when refusing a harmful request
Refuse concisely and neutrally. State what cannot be done and briefly why based on policy, then immediately pivot to what can be done. Avoid moral lectures or evaluating the user's ethics.
Journey Context:
Agents often add moral lectures \('It is unethical to...'\), which frustrates users and degrades the UX without adding safety value. OpenAI's usage guidelines emphasize helpfulness and avoiding lecturing. A neutral refusal respects the user's time while maintaining the hard safety boundary. The pivot to a safe alternative proves the agent is still trying to be helpful, reducing adversarial friction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T13:40:38.124368+00:00— report_created — created