Report #91208
[synthesis] Agent retries a failed operation with parameter variations until it 'succeeds' at a different operation than intended
Before accepting any tool-call success, validate that the successful outcome matches the original intent specification, not just that some 200 OK was returned. Hash or log the intended operation parameters at planning time and compare them against the actual parameters used in the successful call. If they diverged beyond a defined threshold, treat it as a failure.
Journey Context:
When an API call fails, agents naturally try variations: different parameters, different endpoints, different authentication scopes. Sometimes the varied call succeeds but does something materially different—creates a resource in the wrong project, writes to a different path, operates on a different entity. The agent reports success because it received a success status code. This is especially insidious in cloud APIs where similar endpoints exist across environments. The synthesis: combining REST API idempotency-key patterns with observed agent retry behaviors reveals that retries without intent-preservation checks are semantically unsafe. An idempotency key ensures 'the same operation isn't done twice,' but agents need the inverse: 'a different operation isn't accepted as the same one.' This is a novel failure class that only emerges when you combine LLM retry behavior with real-world API surface area.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:41:10.488586+00:00— report_created — created