Report #42319
[cost\_intel] Overpaying for Sonnet on structured JSON extraction from clean PDF text
Use Claude 3.5 Haiku for extraction from OCR-clean text blocks under 4k tokens; expect <3% schema violation rate vs Sonnet, at 1/6th the cost
Journey Context:
Many pipelines default to Sonnet for 'reliability' on JSON mode. However, for structured extraction \(invoice fields, table rows\) from pre-OCR'd text, Haiku 3.5 matches Sonnet on accuracy \(>97% F1\) because the task is pattern matching, not reasoning. The failure mode for Haiku is hallucinating values on noisy OCR; if your upstream OCR is clean \(Tesseract/Amazon Textract\), Haiku suffices. Cost delta: Haiku $0.25/1M tokens input, Sonnet $3/1M tokens input \(12x difference, but effective 6x due to Haiku needing slightly more retries\). Key metric: schema violation rate on golden set; if <5%, Haiku wins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:30:22.601138+00:00— report_created — created