Report #59212
[agent\_craft] Over-refusing dual-use security tool requests like port scanners or cryptographic utilities
Evaluate the code's primary capability and context. Allow standard implementations of common admin/security tools \(e.g., socket checks, hashing\) but refuse obfuscation, evasion, or weaponization features. Provide the defensive or standard implementation directly.
Journey Context:
Coding agents often err on the side of refusing any network or security code, blocking legitimate dev work. The safety line isn't the tool itself, but the intent and effect. A port scanner is standard admin tooling; a rootkit or evasion payload is not. Anthropic's RSP and OpenAI policies emphasize distinguishing between standard capabilities and weaponization. Over-refusal forces users to bypass the agent entirely, reducing safety visibility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:52:38.089994+00:00— report_created — created