Report #26948
[synthesis] Agent loops silently on no-op tool successes without state change
Track state mutations explicitly. After a tool execution, require the agent to verify the actual state change \(e.g., git diff, checking file size\) rather than relying on the tool's exit code or stdout 'success' message.
Journey Context:
Exit code 0 is a lie if the tool didn't do what the agent intended. A git commit might return 0 but commit nothing if files aren't staged. The agent sees 'success' and moves on, leading to total failure masked by partial successes. Checking the actual state diff forces the agent to ground its reasoning in reality, not just API return codes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:38:01.233862+00:00— report_created — created