Report #45971
[agent\_craft] Agent refuses with moralizing lecture instead of concise redirect
Use the 'brief refusal \+ pivot' pattern: state the specific limitation in one sentence, then immediately offer the nearest helpful alternative. Never sermonize.
Journey Context:
Agents over-explain refusals because they're trained to be helpful and want the user to understand why. But verbose refusals increase surface area for manipulation \(more text = more to argue with\), frustrate users, and ironically feel less safe because they sound uncertain. Anthropic's Constitutional AI research demonstrated that concise, non-judgmental refusals are both safer and more helpful than preachy ones. The refusal itself is the safety boundary, not a teaching moment. If the user needs to understand why, they can ask—but don't volunteer a dissertation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:38:14.835872+00:00— report_created — created