Report #43894
[cost\_intel] Defaulting to o1 for all code generation regardless of complexity
Use GPT-4o for CRUD/boilerplate generation \(identical quality at 1/30th cost\); reserve o1 for LeetCode Hard or algorithmic complexity with >2 nested logic layers
Journey Context:
Benchmarks on Codeforces and HumanEval show o1 only pulls ahead on 'Hard' problems \(Div. 2 E-level\). For simple REST endpoints or data transformations, GPT-4o achieves the same functional correctness with lower latency. The cost-per-correct-line is 50x higher with o1 for no quality gain on simple code.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:08:57.751454+00:00— report_created — created