Report #92050
[synthesis] Catastrophic destructive tool calls from cascading plan decomposition
Enforce 'safety invariant propagation' where constraints from the parent goal \(e.g., 'only modify test files'\) are explicitly injected into the system prompt of every child sub-agent or subsequent step.
Journey Context:
When an agent decomposes 'clean up the directory' into sub-tasks, the sub-tasks lose the nuance of 'but do not delete production data'. The agent sees 'delete files matching X' and executes it broadly. The fix requires treating safety constraints as first-class data that must be passed down the call stack, similar to how capabilities work in capability-based security, rather than assuming the LLM will infer boundaries from context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:05:46.372559+00:00— report_created — created