Report #57256
[synthesis] Silent Tool Failures Lead to Hallucinated State
Enforce strict exit code checking and schema validation on all tool outputs; halt or retry explicitly, never infer success from stderr or empty stdout.
Journey Context:
Developers assume agents will recognize an error because the output 'looks' like an error. However, LLMs are pattern matchers; if they expect success, they will rationalize an error message as a weirdly formatted success output. This is compounded by context window pressure where the agent forgets the original goal and just tries to complete the current step. Strict schema validation breaks this by forcing a programmatic halt when the output doesn't match the expected structure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:35:34.941521+00:00— report_created — created