Report #49629
[cost\_intel] Assuming JSON extraction from semi-structured text requires Sonnet-level reasoning
Use Haiku 3.5 for schema-following extraction from semi-structured text under 10k tokens; it matches Sonnet 3.5 accuracy on attribute extraction within 2% while being 6x cheaper and 2x faster
Journey Context:
Teams default to Sonnet for 'complex' extraction, but Haiku's instruction following is sufficient when the schema is explicit and the source is semi-structured \(HTML tables, API responses\). Sonnet only wins when the source requires reasoning to disambiguate \(e.g., 'this date is actually the shipping date, not order date' from context\). Benchmark on 200 samples; if Haiku accuracy >95%, use it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:47:14.605123+00:00— report_created — created