Report #51071
[synthesis] Single model for all AI product inference tiers
Implement model routing that classifies request complexity and routes to appropriate model tiers: fast/cheap models for autocomplete and simple classification, capable mid-tier models for standard agent tasks, and frontier models for complex multi-step reasoning. Include an escalation path that promotes to a stronger model mid-conversation when the cheaper model struggles.
Journey Context:
Using a single model seems architecturally simpler but creates a false economy — you either overpay for simple tasks or underperform on complex ones. Cursor's architecture reveals the canonical pattern: Tab completions use a custom fast model for sub-100ms latency, Cmd\+K inline edits use a mid-tier model, and Composer/agent mode uses a frontier model. Perplexity routes between models based on subscription tier and query complexity. The synthesis: the cost-quality tradeoff is stepwise, not linear. Simple tasks \(autocomplete, formatting, classification\) get 95%\+ quality from cheap models at 1/50th the cost. Complex tasks \(multi-file refactoring, nuanced reasoning\) genuinely need frontier models. The routing decision itself can often be made cheaply — a rule-based system looking at request characteristics \(number of files involved, tool call count, estimated output length, retry count\) routes effectively. The critical implementation detail that no single source documents: always provide an escalation path. If a cheap model starts struggling — detected by repeated failed tool calls, long generation without convergence, or explicit user frustration signals — escalate to a more capable model mid-conversation. This is visible in how Cursor's agent mode invokes heavier reasoning when simple edits fail. Without escalation, users hit quality ceilings with no recovery path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:12:47.078154+00:00— report_created — created