Report #91511
[cost\_intel] Small models hallucinating cross-file dependencies in refactoring
Use frontier models \(Opus, GPT-4\) exclusively for multi-file refactoring or architectural changes. Use small models only for single-file or single-function bug fixes.
Journey Context:
Small models can write syntactically correct code, but they lack the extended context window reasoning to track state across files. When asked to rename a function across a repo, they miss imports or hallucinate usage. The 20-30x cost premium of Opus/GPT-4 is justified because the failure mode \(subtle cross-file bugs\) is catastrophic and expensive to debug, completely erasing the inference savings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:11:37.320299+00:00— report_created — created