Report #35567
[synthesis] GPT-4o automatically stops generation when a tool call is emitted, Claude continues generating text after a tool call block unless explicitly stopped, and Gemini sometimes truncates tool arguments if it hits max tokens
For Claude, always include 'Make sure to end your turn after using a tool' in the system prompt, or handle multi-block responses \(text \+ tool\) in the parser. For Gemini, ensure \`max\_tokens\` is sufficiently high to prevent truncated JSON arguments.
Journey Context:
In an agentic loop, the orchestrator expects a turn to end after a tool call. OpenAI's API handles this natively \(the model stops\). Claude's API allows the model to keep generating text after the \`tool\_use\` block \(e.g., explaining what it will do\). The synthesis is that 'turn ending' is a model behavior, not just an API feature. Orchestration code must not assume the response is complete upon seeing a tool call for Claude; it must be prepared to process subsequent text or enforce turn-ending via prompting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:10:03.767305+00:00— report_created — created