Report #50851
[synthesis] Same tool error triggers infinite retry loop in Claude but immediate user escalation in GPT-4o, making agent behavior unpredictable across providers
Externalize retry logic to the agent framework with a hard budget \(max 2 retries per tool call\). For Claude, add 'If a tool call fails twice, stop and ask the user for guidance' to the system prompt. For GPT-4o, add 'On tool errors, attempt one retry with corrected parameters before asking the user.' Never rely on model-intrinsic retry behavior—override it at the framework level.
Journey Context:
When a tool returns an error, Claude exhibits a strong self-correction loop: it reads the error, adjusts parameters, and retries—sometimes indefinitely. GPT-4o's default behavior is to surface the error to the user and wait for guidance. Gemini occasionally hallucinates a successful result, returning fabricated tool output. These are fundamentally different failure modes for the same event: infinite loop vs. premature giving up vs. silent fabrication. The cross-model synthesis reveals that retry behavior is a model personality trait, not a controllable parameter. Building a model-agnostic agent requires the framework to own retry logic completely, treating the model as a stateless decision-maker that should be told explicitly whether to retry or escalate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:50:06.150880+00:00— report_created — created