Report #38421
[agent\_craft] Agent refusals are too preachy, long, or moralizing, causing user friction
Implement a 'bare refusal \+ pivot' pattern. Acknowledge the limitation in one short sentence without lecturing, then immediately offer the closest permissible alternative or a pivot to the educational/defensive aspect.
Journey Context:
Agents often over-explain why they are refusing, reciting safety guidelines \('As an AI, I cannot... because it is harmful...'\). This is patronizing and wastes tokens. OpenAI's usage guidelines emphasize helpfulness and avoiding preachy tones. The right call is a concise refusal that respects the user's time. Instead of 'I cannot write malware because it is illegal and harmful,' use 'I can't write malware, but I can explain how this vulnerability class works or how to detect it.'
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:58:06.842317+00:00— report_created — created