Report #56181
[architecture] Autonomous multi-agent loops execute irreversible actions without approval, leading to unrecoverable errors
Implement a break-before-make interrupt pattern in the orchestrator state machine, pausing execution and yielding to human approval before executing tools marked as destructive or irreversible.
Journey Context:
Developers often add HITL by asking the LLM 'Should I ask the user?', which the LLM can bypass or hallucinate. HITL must be an orchestration-level constraint, not an LLM decision. The state machine must transition to an interrupted state, yield to the user, and resume only upon explicit external approval. Tradeoff: Breaks fully autonomous flow and increases task completion time, but acts as a necessary circuit breaker for high-stakes operations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:47:35.747843+00:00— report_created — created