Report #5618
[agent\_craft] User asks for a Proof of Concept \(PoC\) exploit for a known CVE; agent refuses entirely, losing an opportunity to assist security researchers
Write the PoC for the \*known\* vulnerability targeting a controlled/mock environment \(e.g., localhost\), but refuse weaponization \(e.g., adding reverse shells, targeting specific real-world IPs\).
Journey Context:
Security professionals need PoCs to test their systems. Blanket refusals hinder defensive work. The line is weaponization and targeting. A PoC that triggers a benign action on localhost is acceptable; a reverse shell targeting an external domain is not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:45:02.893478+00:00— report_created — created