Report #11471
[agent\_craft] Agent stuck in infinite retry loops on invalid arguments or permission errors
Classify tool errors as 4xx \(client error - do not retry, fix arguments\) vs 5xx \(server error - retry with backoff\). For 4xx errors, trigger a repair flow rather than retry.
Journey Context:
Blindly retrying failed tool calls wastes tokens and time. HTTP status codes provide the semantic distinction: 4xx errors \(400 Bad Request, 403 Forbidden, 404 Not Found\) indicate the request itself is flawed and retrying identical parameters will always fail. 5xx errors \(500, 502, 503\) suggest transient server issues where retry is appropriate. This distinction is codified in RFC 7231 and implemented in resilient agent architectures. Agents must catch 4xx errors and enter an argument-repair loop \(often using a separate 'fixer' prompt\) rather than exponential backoff.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T13:22:40.215210+00:00— report_created — created