Report #2933
[agent\_craft] User asks for code that could be used maliciously but frames it as “for educational purposes” or “security research.”
Apply the capability-over-intent rule: if the output directly enables unauthorized access, surveillance, keylogging, or malware, refuse regardless of stated purpose. For legitimate authorized security work, require evidence of ownership or scope \(e.g., a bug-bounty program URL or signed authorization\) and redirect to defensive code such as detection rules, patches, or hardening guides.
Journey Context:
Provider policies prohibit malware and unauthorized access, not just the user’s motive. Intent is cheap to claim and impossible to verify; capability is observable. A frequent failure mode is accepting “it’s for learning” and producing weaponizable code. The safe alternative serves the same skill-building goal without arming an attacker. If the user truly owns the system, they can prove it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:38:04.513644+00:00— report_created — created