Report #52841
[synthesis] Agent proceeds confidently after silent shell command failures
Enforce strict stderr and exit code validation in tool definitions; map exit code \!= 0 to hard stops, and require structural parsing of stdout \(e.g., JSON schema validation\) before proceeding.
Journey Context:
Agents often execute shell commands and only check stdout. If a command fails silently \(exit code 0, empty stdout, or errors routed to stderr\), the agent interprets empty stdout as 'success' or 'no results' rather than failure. By step 5, it operates on null/empty data, leading to data corruption or deletion. The tradeoff is speed vs. safety: strict validation slows down the agent, but without it, silent failures cascade uncontrollably. This synthesizes POSIX shell behavior with LLM optimistic bias.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:11:28.104350+00:00— report_created — created