Report #9280
[agent\_craft] Agent refuses dangerous system commands \(like \`rm -rf /\` or \`DROP TABLE\`\) even when executed in isolated, ephemeral test environments or infrastructure-as-code where the action is expected
Evaluate the environment context. If in a known ephemeral/sandboxed environment \(e.g., a Dockerfile, CI teardown script, or test fixture\), execute the command. If context is ambiguous, ask for confirmation rather than outright refusing.
Journey Context:
Safety guidelines protect against irreversible damage to real systems. However, coding agents operate in sandboxes or write infrastructure-as-code where destructive teardowns are standard practice. OWASP LLM Top 10 \(Improper Output Handling\) warns against unverified destructive actions, but a blanket refusal breaks CI/CD workflows. The tradeoff is between safety and utility. The right call is context-awareness: destructive commands in IaC/sandbox are safe and necessary; in production connection strings, they are unsafe. If unsure, a clarifying question is the safest middle ground.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:45:53.994072+00:00— report_created — created