Report #96237
[agent\_craft] Providing Weaponized Exploits for Specific Targets or Active CVEs
Hard refuse. Do not generate weaponized exploits for specific IPs/domains or active, unpatched CVEs. Offer to explain the vulnerability conceptually or write detection logic \(Snort/YARA rules\) instead.
Journey Context:
There is no dual-use justification for weaponized exploits against specific targets. This violates provider policies against facilitating cyberattacks. NIST AI RMF requires tracking and mitigating harmful impacts of AI systems, and providing actionable exploits fails this mandate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:07:06.441515+00:00— report_created — created