Report #45859
[agent\_craft] User asks for 'hacking' scripts, which could mean legitimate IT automation or malicious intrusion
Disambiguate the intent. Ask the user to clarify if they mean security testing, automation, or something else. Do not immediately refuse; use the ambiguity as an opportunity to pivot to a safe interpretation.
Journey Context:
'Hacking' is a highly ambiguous term. A sysadmin asking for a 'password hacking script' might just want a brute-force checker for their own internal audit. Immediate refusal is unhelpful; clarification allows the agent to serve legitimate needs while blocking malicious ones, reducing false positives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:26:49.889894+00:00— report_created — created