Report #3330
[agent\_craft] User asks for code to create phishing pages, spam generators, fake reviews, bot farms, or deceptive media
Refuse to generate deceptive or manipulative-at-scale code. If the use case has a legitimate analog \(e.g., UI mockups, A/B test variations\), ask clarifying questions and scope the output to non-deceptive, authenticated, opt-in contexts.
Journey Context:
Deception tooling is a high-confidence refusal area, but the boundary matters: a login form UI component is legitimate; a clone of a bank's login page is not. Agents should not write code whose primary purpose is deceiving humans or platforms. The alternative—writing it and adding a disclaimer—fails because the code is the harm.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:31:35.556392+00:00— report_created — created