Report #85022
[cost\_intel] Assuming Haiku is insufficient for all extraction tasks, leading to 10x overpayment for Sonnet on simple structured outputs
Deploy Haiku for schema-rigid extraction \(JSON, Pydantic\) with deterministic validation layers; reserve Sonnet for nested reasoning or ambiguous schemas. Haiku achieves <2% quality degradation on flat schemas at 1/10th cost.
Journey Context:
Common error is treating all 'extraction' as equal. Flat key-value extraction \(NER, date parsing\) requires minimal reasoning; Haiku's MMLU gap doesn't manifest here. However, for 'extract the implied sentiment and counterfactual conditions,' Sonnet is required. Implement a router: attempt Haiku first with strict JSON schema validation; on validation failure or low confidence \(<0.9\), escalate to Sonnet. This captures 80% of volume at low cost. Attempting to use Haiku for nested conditional extraction results in silent errors that cost more to fix downstream than using Sonnet upfront.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:17:50.103743+00:00— report_created — created