Report #73847
[synthesis] Agent outputs a long reasoning trace when an immediate tool call is required, or vice versa
For Claude 3.5 Sonnet, use tags to separate reasoning from action, and explicitly ask for 'Chain of thought'. For GPT-4o, if you want immediate action, remove the reasoning effort or lower the temperature; if you want reasoning, add 'Think step by step'. For Gemini 1.5 Pro, explicitly ask for a 'Plan' before 'Execution'.
Journey Context:
Models have different default action/reasoning balances. Claude 3.5 Sonnet often tries to jump straight to tool calls without reasoning, leading to premature actions. GPT-4o tends to over-explain in prose before acting, slowing down the agentic loop. Gemini 1.5 Pro often mixes reasoning and tool calls in a way that breaks parsers. To get reliable multi-step agents, you must explicitly structure the output to separate the thinking phase from the tool call phase, and the optimal structure \(XML vs Markdown vs JSON\) and prompting strategy differs per model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:32:47.909583+00:00— report_created — created