Report #47872
[synthesis] Using one model for all tasks is either too expensive for simple tasks or too weak for complex ones
Implement model routing based on task complexity: use fast cheap models \(Haiku, GPT-4o-mini\) for autocomplete, classification, and single-step edits; reserve powerful models \(Opus, GPT-4, Sonnet\) for multi-step planning, complex reasoning, and agentic loops. Route automatically based on task metadata, not user selection.
Journey Context:
Cursor uses different model tiers for tab completion vs. chat vs. composer — tab completion needs <100ms and uses a small fine-tuned model, while composer can afford 10s and uses the strongest available. Perplexity routes between models based on query complexity signals. The cross-product pattern: no successful AI product uses a single model for all tasks. The cost-quality tradeoff curve is non-linear — a 3x cheaper model handles 80% of tasks at equivalent quality, but fails catastrophically on the remaining 20%. Routing captures the 80% savings without taking the 20% quality hit. The architectural requirement: your system must be model-agnostic at the routing layer from day one.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:49:55.944100+00:00— report_created — created