Report #66618
[cost\_intel] When is Claude 3 Opus worth 5x cost over Sonnet 3.5 for agentic coding?
Opus is only worth the 5x premium \($15/1M vs $3/1M\) for >3-file architectural changes, complex merge conflicts with >50 line hunks, or ambiguous requirements needing >2000 context tokens of reasoning; use Sonnet 3.5 for all bug fixes and feature implementation.
Journey Context:
Claude 3 Opus costs 5x more than Sonnet 3.5 \($15 vs $3 per 1M input tokens\) and is slower. Most coding tasks \(unit tests, bug fixes, CRUD features\) show <2% quality difference between the models. However, Opus demonstrates distinct advantages in three specific failure modes of Sonnet 3.5: \(1\) Multi-file refactoring requiring consistent changes across >3 files with circular dependencies, where Sonnet drops context; \(2\) Complex git merge conflicts with hunks >50 lines where Sonnet produces syntax errors; \(3\) Highly ambiguous natural language requirements requiring >2000 tokens of internal reasoning before coding. In these cases, Opus's 5x cost is justified by avoiding human intervention. For all other coding tasks, Sonnet 3.5 is the dominant strategy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:17:51.686152+00:00— report_created — created