Report #38132
[cost\_intel] Using o3-mini-high for React boilerplate generation costs 20x more than Claude 3.5 Sonnet with equivalent output quality
Use mid-tier instruct models \(Claude 3.5 Sonnet, GPT-4o\) for boilerplate, CRUD, and test generation; reserve reasoning models \(o1, o3\) for debugging complex bugs, algorithmic problems rated >1600 Codeforces, or multi-file refactoring
Journey Context:
Reasoning models over-engineer simple patterns, adding unnecessary abstractions due to test-time over-optimization. Token cost is 10-50x. Instruct models pattern-match common idioms perfectly with lower latency. Quality on routine tasks often degrades with reasoning due to 'overthinking'—generating complex generic factories where simple functions suffice.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:29:02.248601+00:00— report_created — created