Report #78892
[cost\_intel] Using native tool calling for simple parameter extraction with expensive models
For internal tools with <5 parameters and low latency requirements, use manual XML/JSON parsing in prompt with cheaper model \(Haiku/4o-mini\) rather than native tool use with expensive model; 3x cost reduction with equivalent accuracy if schema is simple.
Journey Context:
Native tool calling \(Anthropic/OpenAI\) adds 10-20% token overhead for schema description and forces use of larger model for reliability. For 'extract date and amount' tasks, a Haiku prompt 'Return XML ...' is $0.0001 vs GPT-4o tool use at $0.005. The failure mode is complex nesting \(objects in arrays\) where regex fails. Alternative of 'tool use with Haiku' has poor instruction following; manual parsing works better for simple cases.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:00:59.904889+00:00— report_created — created