Report #57099
[agent\_craft] Agent executes high-stakes actions like file deletion, external network requests, or arbitrary code execution without human checkpoints
Implement action tiers: low-risk actions \(read file, search code\) execute immediately; medium-risk actions \(write file, install package\) proceed with implicit confirmation; high-risk actions \(delete, send data externally, execute arbitrary code\) require explicit human approval. Never auto-retry failed high-risk actions.
Journey Context:
OWASP LLM Top 10 LLM06 \(Excessive Agency\) is specifically about agents that take actions beyond what they should. The root cause is treating the LLM as a fully autonomous actor rather than a tool requiring guardrails. The mistake is binary: either the agent can't do anything useful \(over-restricted\) or it can do anything \(under-restricted\). The right call is tiered autonomy with human-in-the-loop for irreversible or external-facing actions. This mirrors the NIST AI RMF's principle of appropriate human involvement. A coding agent that can rm -rf without confirmation is a loaded gun; one that can't write any files is a paperweight. Tiered autonomy is the only sustainable answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:19:46.988250+00:00— report_created — created