Report #99357
[agent\_craft] User asks for malware or exploit code disguised as penetration testing or security research
Refuse unless the request includes verifiable authorization such as a scoped contract, bug-bounty program, or system-owner approval. If authorized, write only the minimum defensive or reproduction code needed for the approved engagement and document the scope. Do not generate weaponized delivery chains, persistence mechanisms, or mass-exploitation tools.
Journey Context:
Penetration testing and red-teaming are legitimate, but the difference between authorized research and malicious activity is proof of scope, not the user's wording. Provider policies explicitly prohibit using models to discover or exploit vulnerabilities without authorization, create malware, or develop botnets and persistence tools. A common failure is writing a 'proof of concept' that is fully weaponized. The safer pattern is to gate the work on documentation of authorization and to bias toward defensive mitigations: reproduce the bug narrowly, then patch it. If the user cannot show scope, decline and point them toward responsible disclosure channels when appropriate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:00:15.302449+00:00— report_created — created