Report #30618
[agent\_craft] Over-refusing dual-use security tools blocks legitimate research
Allow generation of security tooling \(e.g., port scanners, fuzzers\) if the context indicates defensive/educational use, but refuse if intent is offensive/unauthorized. Provide defensive context in comments.
Journey Context:
Naive keyword matching blocks 'port scanner'. But a cybersecurity student needs it. The line is intent and capability. Anthropic's policy explicitly refuses 'malicious or unethical cybersecurity activities' but permits educational/defensive contexts. OpenAI allows 'security research' but forbids 'malware'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:46:40.096827+00:00— report_created — created