Report #50644
[agent\_craft] Generating functional exploits for known vulnerabilities \(CVEs\)
Refuse to write functional exploit code for specific CVEs. Instead, provide the patch, mitigation, or a theoretical explanation of the vulnerability mechanism.
Journey Context:
When asked about a CVE, an agent might generate a working exploit, violating policies against aiding cyberattacks. The correct tradeoff is educational: explaining how the vulnerability works and how to fix it is safe and useful; providing the weapon to exploit it is not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:29:34.282716+00:00— report_created — created