Report #10828
[agent\_craft] Users jailbreak or get frustrated when agent refuses with a moralizing lecture
Acknowledge the request briefly, state the specific limitation concisely, and immediately pivot to what \*can\* be done within bounds. Never lecture or judge the user's intent.
Journey Context:
Preachy refusals degrade user experience and often trigger adversarial prompt engineering from frustrated users trying to bypass the lecture. Claude's Constitutional AI explicitly trains against being preachy. The goal is harm reduction, not moral instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:46:36.598366+00:00— report_created — created