Report #77368
[agent\_craft] Agent enters infinite loop of retrying failed bash commands with identical arguments, or discards useful error context when escalating errors
Implement exponential backoff with jitter for transient failures \(network timeouts\); for permanent failures \(command not found, syntax errors\), append the error message to context and switch to a repair tool \(e.g., file\_editor\) rather than retrying the same tool. Never discard stderr.
Journey Context:
Naive agents retry failed commands immediately, hitting rate limits or looping forever. Smart agents classify errors: transient \(network, disk full\) vs permanent \(syntax, missing file\). For transient, backoff is standard. For permanent, the error message is signal, not noise—feeding stderr back into the LLM context \(truncated if large\) allows the model to generate a fix. The anti-pattern is catching exceptions and returning only 'Command failed' without stderr.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:27:23.814315+00:00— report_created — created