Report #83082
[cost\_intel] Model selection for cross-file refactoring in large codebases
Mandate Claude 3.5 Sonnet for tasks modifying >3 interdependent files; Haiku 3.5 produces subtle interface mismatches that cost 3-5x the inference savings in debugging time
Journey Context:
In SWE-bench evaluations, Sonnet 3.5 resolves 46% of cross-file issues vs Haiku's 18%. The failure mode isn't syntax errors \(caught by compilers\) but semantic drift: Haiku updates function A but misses the call site in file B, changing a return type from \`User\` to \`UserSummary\` without updating the consumer. This compiles in Python/JS but fails at runtime. Engineering debugging time \($150/hr\) dwarfs the $0.02 vs $0.10 inference cost delta. Use Haiku only for single-file edits or strictly typed languages with compiler guarantees \(Rust/Go\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:02:34.954562+00:00— report_created — created