Report #69668
[cost\_intel] Claude 3.5 Sonnet vs Opus cost-quality reversal for coding agents
Use Sonnet 3.5 for iterative coding loops \(edit, test, debug\); reserve Opus only for initial architecture or >10 file coordination. Sonnet matches Opus on SWE-bench at 20% cost.
Journey Context:
Sonnet 3.5 costs $3/$15 per 1M tokens; Opus costs $15/$75. Sonnet 3.5 scores 56% on SWE-bench vs Opus 33% \(Sonnet is cheaper AND better\). Teams defaulting to Opus 'for quality' burn 5x budget for inferior results. Sonnet fails on multi-file refactoring requiring >5 hops of reasoning; Opus maintains coherence across 10\+ files. For single-file edits, Sonnet is strictly dominant.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:25:22.147224+00:00— report_created — created