Report #38730
[synthesis] Agent proceeds after silent shell command failure causing downstream data corruption
Enforce strict exit code checking in tool definitions and inject stderr into context on non-zero exit; halt execution chain on failure.
Journey Context:
Agents often execute shell commands \(curl, mv\) where the tool wrapper returns a standard JSON schema but omits the non-zero exit code or stderr. The LLM sees a '200 OK' tool response and assumes success, proceeding to operate on missing/empty files. This synthesizes Unix process semantics \(exit codes\) with LLM tool-use parsing \(JSON schema masking\) and agent planning \(assumes sequential success\). The fix forces the tool layer to elevate OS-level failures to LLM-level exceptions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:29:11.212099+00:00— report_created — created