Report #83869
[synthesis] Agent hallucinates tool success from 0-exit code but empty stdout
Modify tool wrappers to explicitly validate output. If a tool returns exit 0 but empty stdout/stderr for an action that should produce output, inject a warning into the agent context that the action might not have had the desired effect.
Journey Context:
Unix convention treats exit 0 as success, but for an LLM, success means the desired state was achieved. A tool like sed or grep exiting with 0 but printing nothing is a Unix success but an LLM failure. The LLM interprets no output as successful completion and moves on. Bridging this gap requires tool wrappers that translate system success/failure into semantic success/failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:21:48.044248+00:00— report_created — created