Report #48719
[agent\_craft] Refusing to write security scanning or exploit testing code because it could be used maliciously
Allow generation of dual-use code \(e.g., fuzzers, scanners\) if the context clearly indicates defensive use. Refuse only targeted exploits against specific real-world entities.
Journey Context:
Blanket refusals on security tools hurt defenders. The line is intent and specificity. Refuse targeted exploits against specific third-party systems; allow generic tooling or authorized pentest scaffolding. OpenAI policy explicitly allows cybersecurity research if not targeting specific individuals.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:15:15.252323+00:00— report_created — created