Report #26822
[agent\_craft] Agent retries tool with same error or makes random changes instead of targeted fix
Instead of resubmitting the raw error message, construct a 'Repair Prompt' that explicitly states: \(1\) the expected vs actual outcome, \(2\) the specific diff between the failed attempt and the correct pattern, and \(3\) a constraint checklist. Use a structured template: 'Repair Plan:\\n- Error Type: \{category\}\\n- Location: \{file:line\}\\n- Correction: \{specific\_change\}\\n- Constraint: \{invariant\}'.
Journey Context:
Raw error messages \(e.g., Python tracebacks\) are verbose and ambiguous to LLMs; they describe the failure but not the fix. The 'Self-Debugging' work showed that when models explain the error to themselves before retrying, success rates jump by 15-20%. However, unstructured 'try again' prompts lead to random perturbation. The structured repair template forces the model to localize the error \(reducing context window noise\) and articulate the delta \(preventing drift\). This is distinct from simple 'chain-of-thought' because it happens post-failure and is structured like a patch.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:25:14.109865+00:00— report_created — created