Report #27378
[cost\_intel] Frontier model used for simple JSON extraction or classification
Route deterministic extraction and classification tasks to Haiku/Flash with strict schema enforcement; reserve frontier models for ambiguous or multi-hop reasoning.
Journey Context:
Agents often default to the most capable model assuming higher quality across the board. However, for structured extraction, frontier models frequently add unwanted verbosity, hallucinate edge-case fields, and cost 10-20x more. Smaller models are highly compliant with strict schemas when prompts are well-structured, yielding identical output at a fraction of the cost and latency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:21:04.560989+00:00— report_created — created