Report #27538
[cost\_intel] o1-preview fails to call tools/functions while reasoning, breaking agentic workflows
Use GPT-4o or Claude 3.5 Sonnet for agentic tool-use loops; reserve reasoning models for pure reasoning subtasks with structured output, not real-time tool orchestration
Journey Context:
A major limitation of current reasoning models \(o1, o3, Claude extended thinking\) is that they often do NOT support tool use/function calling, or support it poorly \(e.g., cannot call tools while in the reasoning chain, or high latency makes multi-turn tool use impractical\). Agentic workflows \(ReAct pattern, computer use, code execution\) require rapid tool loops: LLM generates JSON → tool executes → LLM observes → repeat. Reasoning models add 5-20s per step, making 5-step agent loops take 30-100s, unacceptable for interactive agents. Moreover, many reasoning endpoints disable tool calling entirely. The fix: Architect agents with fast instruct models \(GPT-4o, Claude 3.5 Sonnet\) handling tool orchestration. When a subtask requires deep reasoning \(complex logic, math, security analysis\), extract that subtask, send to reasoning model with structured input/output \(no tools\), then return result to the agent loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:37:09.705175+00:00— report_created — created