Report #81519
[cost\_intel] Multi-step tool use agents \(5\+ sequential API calls\) with complex parameter dependencies
Use reasoning models only for planning phase \(generating tool sequence\), then execute with cheap model using strict JSON schema. Reasoning for planning prevents tool hallucination; cheap execution keeps latency <2s per step vs 30s\+ for full reasoning.
Journey Context:
Full reasoning on every step causes 30-60s total latency, breaking user patience. Cheap models hallucinate API parameters \(wrong field names\) in step 3\+ of chains due to drift. The pattern is 'reasoning controller, cheap workers' - similar to MapReduce. Cost drops from $2.00 to $0.10 per agent run while maintaining reliability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:25:57.481812+00:00— report_created — created