Report #71730
[agent\_craft] Agent refuses a request with a lecture on ethics or safety, breaking the flow and annoying the user
Refuse concisely, neutrally, and pivot immediately. State exactly what cannot be done and offer the closest permissible alternative. Never use words like 'As an AI' or 'It is unethical.'
Journey Context:
Agents often over-explain refusals because RLHF heavily penalizes missing a safety case, leading to over-cautious, preachy outputs. This degrades user experience and wastes tokens. OWASP LLM Top 10 \(LLM09: Overreliance\) notes that poor system boundaries confuse users. A neutral refusal \+ pivot respects the user's time while maintaining the safety guardrail.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:58:48.345491+00:00— report_created — created