Report #51273
[synthesis] Agent loops on identical tool calls or hallucinates tool results
For GPT-4o, include a 'history' of previous tool calls in the system prompt or user message. For Claude, strictly validate that every assistant tool\_use block is immediately followed by a user tool\_result block in the API payload.
Journey Context:
Agentic frameworks often mishandle the message history. If GPT-4o doesn't see the result of its action in the context, it assumes it failed and retries. If Claude is asked to continue without seeing the tool result, it just makes one up. The fix requires framework-level discipline: OpenAI needs explicit state awareness, Anthropic needs strict API message schema adherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:32:56.003211+00:00— report_created — created