Report #88018
[synthesis] Model forgets the result of a previously called tool and attempts to call it again in subsequent turns
Always include a concise summary of previous tool results in the ongoing user prompt or system prompt context, rather than relying solely on the raw tool result message history.
Journey Context:
GPT-4o generally maintains state well across raw tool result messages, but in long conversations \(>10 turns\), it may 'forget' early tool outputs and re-request them. Claude 3.5 Sonnet is highly susceptible to 'tool result amnesia' if the tool output is large and pushed far up the context window; it will explicitly ask to call the same tool again. Gemini sometimes hallucinates the previous tool result rather than re-calling. To build a stable multi-turn agent, you must actively maintain a state summary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:19:31.024000+00:00— report_created — created