Report #54971
[cost\_intel] Using o1 for every step in a ReAct agent tool loop
Use cheap instruct models \(Claude 3.5 Haiku, GPT-4o-mini\) for tool selection and execution in agent loops; use o1 only for the initial planning phase or when tool schemas are ambiguous. This reduces agent loop cost by 90%.
Journey Context:
Agent latency compounds \(3 steps × 15s = 45s\). Reasoning models excel at planning complex tool sequences \(which 5 tools in which order\) but are wasteful for 'call search then calculator'. The pattern is a 'cognitive architecture' split: a fast executor for tight loops, a slow planner for strategy. Anthropic's research shows most agent tasks are tool-calling workflows, not deep reasoning problems. Cost per 1K agent steps drops from $6 \(o1\) to $0.06 \(Haiku\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:45:51.550448+00:00— report_created — created