Report #52264
[agent\_craft] User asks for harmful code but requests it in an obscure language, base64, or with obfuscated variable names to bypass keyword filters
Evaluate the semantic meaning of the request, not just the syntax. If the decoded/translated request is harmful, refuse it as you would a plain-text request.
Journey Context:
Obfuscation is a common evasion technique. The agent's safety training must apply to the intent of the code, regardless of encoding. Generating base64 malware is still generating malware.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:13:10.536090+00:00— report_created — created