Report #72283
[cost\_intel] Claude 3.5 Haiku matching Sonnet cost for structured JSON extraction tasks
Use Haiku for schema-rigid extraction with <10 categorical values and 3-5 few-shot examples; deploy a confidence cascade \(top logprob < -0.5 → Sonnet\) for ambiguous inputs. Delivers 10x cost reduction with <3% accuracy degradation vs pure Sonnet on clean data.
Journey Context:
Haiku 3.5 matches Sonnet 3.5 on MMLU but fails on nuanced category disambiguation \(e.g., conflating 'Senior Engineer' vs 'Staff Engineer'\). Few-shot examples eliminate this gap; without them, Haiku drifts 8-12% on edge cases. The cascade pattern isolates misfires: Haiku handles 90% of traffic cheaply, Sonnet handles the uncertain 10%. Without the cascade, blind Haiku deployment causes silent quality regression.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:54:47.615808+00:00— report_created — created