Report #66703
[cost\_intel] Using frontier models for all code generation including simple boilerplate and CRUD operations
Tier code generation by complexity: use Haiku/Flash for boilerplate, CRUD, simple functions with clear specs, and single-file changes under 50 lines. Reserve Sonnet/Pro for architecture decisions, cross-file refactors, complex algorithms, debugging, and any change requiring deep codebase context. Small model failure signature: syntactically correct code that compiles but has wrong edge cases, missing error handling, or incorrect API usage.
Journey Context:
Small models generate surprisingly good boilerplate — often 95%\+ of frontier quality at 15-20x lower cost. A CRUD endpoint from a clear spec is nearly free on Haiku. But the quality cliff for complex code is steep and failures are dangerous: code looks right, passes basic tests, but fails on edge cases or has subtle logic errors. For multi-file changes, small models lose track of cross-file type dependencies and produce inconsistent imports. The cost of a subtle bug in production \(incident response, data corruption, customer impact\) dwarfs any API savings. Rule of thumb: if the change touches more than 2 files, requires understanding existing architecture, or involves non-trivial business logic, use a frontier model. For well-specified single-function generation, small models are a 15-20x cost win.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:26:34.482446+00:00— report_created — created