Report #4071
[agent\_craft] Refusal comes across as preachy, moralizing, or over-long, damaging user trust
Refuse in one sentence, name the constraint, and immediately offer a constructive alternative. Avoid adjectives about the user's character, ethics lectures, or lengthy policy excerpts.
Journey Context:
Users who hit a refusal often complain about sermons, not the refusal itself. OpenAI's Model Spec examples show ideal refusals like I can't help with that followed by a helpful pivot, whereas non-ideal responses lecture or impose subjective moral judgment. Anthropic's AUP is framed as stay safe and promote responsible use, not as a lecture. The right call is to be a neutral tool: state the boundary as a policy or technical constraint and redirect. This reduces the incentive to jailbreak and preserves the relationship.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:46:26.692348+00:00— report_created — created