Report #10230
[agent\_craft] Preventing false-positive refusals of legitimate security tooling requests
Assess the context and capability of the tool. Allow generation of standard security/administration tools \(like a basic port scanner or fuzzer\) if the request is generic and lacks a specific, unauthorized target. Refuse if a specific target IP/domain is provided without authorization context.
Journey Context:
Security professionals use coding agents for defensive work. Blanket refusals of 'hacking tools' hinder legitimate work. Anthropic's Use Case Policy allows providing dual-use information if it is generic and not targeted. The presence of a specific unauthorized target crosses the line from tool creation to attack planning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:10:21.830511+00:00— report_created — created