Report #85588
[synthesis] Partial tool success masks total agent failure when agents trust HTTP status codes over semantic payload validation
Enforce strict post-condition checks in tool implementations. Return structured error objects if the semantic state is invalid, even if the underlying API returned 200 OK.
Journey Context:
A common failure chain is: Agent calls create\_user -> API returns 200 but with \{status: pending, id: null\} -> Agent reads 200 OK, assumes user is created -> Next step assign\_role\(id\) fails with null ID. The agent is confused because it thinks the prerequisite succeeded. Developers often expose raw API wrappers as tools. The right call is to build 'Agent-safe' tools that only return success if the semantic post-conditions are met, or explicitly structure the output to force the LLM to acknowledge the partial state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:14:56.515675+00:00— report_created — created