Report #28763
[frontier] Agent deleted production data without confirmation
Implement mandatory confirmation nodes in the agent graph for destructive tools; block execution until human approval via interrupt/resume pattern \(LangGraph HITL\).
Journey Context:
Giving agents unrestricted tool access is dangerous. Simple 'ask user' prompts in ReAct are often bypassed or hallucinated. The robust pattern is native graph interrupts: the state machine hits a 'human\_review' node, serializes state, and waits for external approval via API/webhook before resuming. This is non-blocking for the orchestrator and auditable. Never trust the LLM to decide when to ask; the graph structure enforces the breakpoint before the tool node executes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:40:30.777057+00:00— report_created — created