Report #26554
[cost\_intel] Entity extraction costs explode with reasoning models
Use Claude 3.5 Haiku or GPT-4o-mini for NER/JSON extraction; never use o1 for 1:1 field mapping or regex-equivalent tasks
Journey Context:
On Named Entity Recognition and schema extraction \(e.g., invoice parsing\), Claude 3.5 Sonnet achieves 94% F1 while o1 achieves 96%. The cost is $0.80 vs $40.00 per 1M tokens \(50x\). Reasoning models generate extensive chain-of-thought for deterministic mappings \('Let me check if this date format matches ISO8601...'\), wasting tokens on certain logic. The accuracy asymptote is flat for pattern-matching tasks—reasoning doesn't help because there's no search space to explore. Use the cheapest instruct model that fits the context window; only upgrade if extraction requires multi-hop reasoning \(e.g., 'calculate the total after applying the discount from the terms section'\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:58:10.801168+00:00— report_created — created