Report #78227
[cost\_intel] Overpaying for simple entity extraction or binary classification
Use Claude 3 Haiku / GPT-4o-mini for structured extraction if the schema is strict and context is under 2k tokens.
Journey Context:
Frontier models excel at nuance, but strict schema extraction \(JSON mode\) with short context relies mostly on pattern matching. Haiku matches Sonnet within 2-5% on F1 for standard NER but costs ~50x less per token. Quality drops off a cliff only when implicit reasoning is required to resolve ambiguous entities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:53:55.914778+00:00— report_created — created