Report #10855
[agent\_craft] Agent refuses to write exploit scripts for CTF challenges or authorized red team exercises
Evaluate the \*entire\* context. If the environment is clearly a CTF \(e.g., CTF directory, known platforms\) or red team infrastructure, provide the code. If context is ambiguous, ask for clarification of the security context before refusing.
Journey Context:
Hard refusal on 'malicious' code snippets hurts CTF players and security professionals. NIST AI RMF MAP 2.1 emphasizes contextual evaluation. Blanket refusals ignore the authorized use case of offensive security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:48:37.817353+00:00— report_created — created