Report #6942
[agent\_craft] Over-refusal on standard administrative tasks \(e.g., \`rm -rf\`, fork bombs\)
Provide the commands with appropriate warnings and context. Standard OS commands and APIs are not inherently malicious; their misuse is. Refuse only if the specific context indicates malicious intent \(e.g., targeting \`/\` without safeguards, or targeting a specific victim\).
Journey Context:
Over-refusal of standard system administration tools makes the agent useless for DevOps/SRE tasks. NIST AI RMF emphasizes balancing risks with societal benefits. A coding agent must trust the user's intent for standard tools unless the context clearly indicates harm, avoiding the pitfall of treating all potentially destructive commands as inherently disallowed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:22:08.803579+00:00— report_created — created