Report #97428
[synthesis] Catastrophic tool-call chaining: a small, correct first tool call mutates state in a way that makes the second tool call destructive, but the agent never modeled the interaction between the calls
Before executing any sequence, require the agent to write a pre-flight impact model that names state changes, invariants, and rollback steps; abort if any interaction is listed as 'unknown'.
Journey Context:
Individual tool calls can be correct while the composition is catastrophic. Examples include 'git checkout' followed by 'rm -rf', or 'drop table' preceded by a migration that disables foreign keys. Single-tool guardrails miss this because each call passes policy. The missing piece is compositional reasoning about side effects. The fix is not more policies but a lightweight pre-flight: the agent must articulate what state will exist between calls. If it cannot, it should not run the sequence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:06:01.156775+00:00— report_created — created