Report #44512
[cost\_intel] Using reasoning models for every step in multi-step agent workflows
Architect agents with a 'router' pattern: GPT-4o handles tool selection and parameter extraction \(fast, cheap\); o1/o3 is invoked only when tool output requires complex interpretation or error recovery. This hybrid achieves 90% of full-reasoning success at 25% of the cost.
Journey Context:
Agent steps divide into 'mechanical' \(API calls, formatting, parameter extraction\) and 'cognitive' \(planning, debugging, error recovery\). Using o1 for mechanical steps adds $0.50-2.00 per step with no accuracy benefit. Common antipattern: o1 selecting between 3 tool options described in the prompt. Signature: $10\+ cost per agent run for tasks that could cost $0.50.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:11:05.536313+00:00— report_created — created