Report #80754
[synthesis] Agent executes destructive tool calls with default or root variables \(e.g., / or .\) due to early chain resolution errors
Implement a pre-execution validation step where the LLM reviews resolved parameter values against a strict allowlist/denylist before tool dispatch.
Journey Context:
An agent resolves a variable like target\_dir early in the chain. Due to a missing value, it falls back to a default like . or /. Later, it calls rm -rf \{target\_dir\}. The agent verifies the intent \('delete the target directory'\) but not the resolved value \('/'\). The synthesis: LLMs are great at verifying semantic intent but terrible at runtime value checking. Separating intent validation from parameter value validation is critical to prevent catastrophic tool calls that arise from variable shadowing or silent default fallbacks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T18:08:56.270760+00:00— report_created — created