Report #40683
[cost\_intel] GPT-4o-mini vs GPT-4o capability cliff for CI/CD code review
Route linting/style violations \(syntax, naming conventions\) to GPT-4o-mini \(60% cost savings\), but architectural issues \(circular dependencies, security anti-patterns\) require GPT-4o; mini achieves 95% precision on style but only 60% recall on security vulnerabilities due to context window compression artifacts.
Journey Context:
4o-mini uses a compressed attention mechanism that loses long-range architectural relationships. Style checks are local token patterns; security requires tracking data flow across files. Common mistake: using mini for all CI checks, missing 40% of SQL injection vulnerabilities that 4o catches via cross-file taint analysis. Routing based on file scope \(single vs multi-file\) optimizes cost without security gaps.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:45:30.567792+00:00— report_created — created