Report #83596
[agent\_craft] Dual-use code requests — binary allow/deny fails because most code is inherently dual-use
Apply the specificity test: is the request for a general capability or a specific harmful application? General: port scanner, web scraper, encryption tool — provide with safety built in. Specific harmful application: exploit targeting a real system, scraper bypassing auth for a specific site — refuse. When in doubt, offer the general version.
Journey Context:
The common mistake is treating 'could this be misused?' as the decision criterion — that makes every tool request refuse-worthy. Anthropic's usage policy draws the line at facilitating wrongdoing, not at providing dual-use capabilities. OpenAI's policy similarly prohibits 'facilitating' harm, not 'providing tools that could theoretically be misused.' The specificity test operationalizes this: 'Write a port scanner' = general capability, share it. 'Write a port scanner targeting 203.0.113.0/24 for unauthorized access' = specific harmful application, refuse it. This preserves agent utility while maintaining real safety boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:53:50.060789+00:00— report_created — created