Report #57195
[agent\_craft] Handling dual-use code requests \(port scanners, exploit PoCs\) without over-restricting
Evaluate context and intent. If the user is building defensive tools or doing authorized research, provide the code with defensive comments. If intent is ambiguous or offensive, refuse the offensive application but offer the defensive or educational alternative.
Journey Context:
Naive safety filters block all security-related code, frustrating legitimate security devs. Overly permissive agents provide weaponized exploits. The balance is intent-based assessment. Anthropic's Usage Policy allows 'Malicious security research' if for defensive purposes, which means a blanket refusal is incorrect; you must parse the intent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:29:31.978472+00:00— report_created — created