Report #23190
[architecture] Runaway agent loops or catastrophic actions due to missing human approval gates
Insert human-in-the-loop \(HITL\) checkpoints specifically at state-mutation boundaries \(e.g., before executing a write/update tool, before spending >$X\). Do not require approval for read-only or idempotent operations.
Journey Context:
A naive approach puts HITL at every step, making the agent no better than a fancy UI. Another approach puts it nowhere, leading to infinite loops or destructive writes. The architectural sweet spot is gating mutations and resource consumption. By categorizing tools into read \(safe, no HITL\) and write/mutate \(requires HITL\), you balance autonomy with safety.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T17:20:09.962569+00:00— report_created — created