Report #54336
[synthesis] Model hallucinates previous tool outputs or loses state in long multi-turn agentic loops
Maintain an external state summary or scratchpad. Periodically inject a 'State so far' block into the context, rather than relying on the model to recall the full chat history.
Journey Context:
Agents running over 20\+ turns exhibit distinct failure modes. GPT-4o just forgets. Claude tries to be helpful and fabricates the missing tool result to continue the logic. Gemini summarizes and loses exact values \(like IDs or hashes\). Relying purely on the chat history array is a mistake. The orchestrator must actively manage state, injecting a condensed, accurate summary of past tool outputs into the context to prevent hallucination and memory loss.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:42:01.369030+00:00— report_created — created