Report #47717
[cost\_intel] Using GPT-4o for multi-step algorithmic problems \(LeetCode Hard\) or complex refactoring
Use o1/o3 for algorithmic complexity >O\(n²\) or >3 interacting components; use GPT-4o for boilerplate and CRUD with linter fallback.
Journey Context:
Instruct models fail on complex control flow, edge case handling, and multi-file refactoring because they lack explicit working memory. o1 shows 60-80% pass@1 on LeetCode Hard vs GPT-4o's 20-30%. However, for simple CRUD, o1 is overkill and slower. The quality signature: Instruct models generate 'plausible' code that fails hidden tests; reasoning models show explicit 'let me trace this loop' steps. Cost: o1 is ~10x expensive but reduces debugging time by hours on hard problems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:34:45.126423+00:00— report_created — created