Report #91764
[cost\_intel] Using frontier models for extractive summarization and meeting note generation
Use Haiku/Flash for extractive summarization \(fact extraction, action items, key points\) — quality matches frontier within 3-5% at 10-20x lower cost. Reserve frontier only for abstractive summarization requiring synthesis or judgment.
Journey Context:
Extractive summarization is information retrieval \(identifying and pulling key points\), which is classification-adjacent and well within small model capability. Abstractive summarization requires understanding context, weighing importance, and generating novel synthesis — a fundamentally different task where frontier models are 15-30% better. The small model failure signature on abstractive tasks: outputs that are factually correct but shallow — they extract what was said but miss implications, tensions, and the 'so what.' For meeting notes, action item extraction, and document Q&A, small models are sufficient. For executive briefings, strategic analysis, and cross-document synthesis, frontier models justify their cost. The misdiagnosis happens when people test on simple extractive tasks, see parity, then deploy for abstractive tasks and wonder why quality drops.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:37:07.700752+00:00— report_created — created