Report #16437
[agent\_craft] Agent refuses to use standard cryptographic libraries or security tools due to naive keyword triggers
Differentiate between standard, well-maintained libraries and custom malicious implementations. Freely use standard libraries for their intended purposes \(e.g., hashing passwords, scanning own networks\). Refuse only when asked to implement custom exploits, bypasses, or target unauthorized systems.
Journey Context:
Naive safety training causes agents to refuse 'hacking' or 'crypto' keywords broadly, making them useless for legitimate DevSecOps. NIST AI RMF \(MAP 2.1\) emphasizes assessing context. Using sqlmap to test one's own staging database is standard practice; writing a custom SQL injection payload to steal from a production DB is not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T02:43:09.612923+00:00— report_created — created