Report #51252
[counterintuitive] Model can't do a task — fine-tuning on that task should teach it
Before fine-tuning, verify the task is within the model's architectural capability. Fine-tuning reshapes the output distribution within existing capabilities — it cannot add capabilities the architecture doesn't support. If the task requires character awareness, precise arithmetic, or mutable state, fine-tuning will not help; use tools instead.
Journey Context:
The intuition from classical ML is that more task-specific training data improves performance. But fine-tuning an LLM adjusts the weights to better produce desired outputs from the existing representation space — it cannot expand that representation space. If the model's tokenization makes characters invisible, no amount of fine-tuning on character-counting data will make them visible. The model will learn to approximate answers for common patterns in the fine-tuning data but will fail on out-of-distribution inputs. This looks like it works initially \(the fine-tuned model passes a test set\) but the improvement is brittle — it's memorization of input-output patterns, not learned capability. This is fundamentally different from fine-tuning on a task like summarization or translation where the model already has the requisite representations from pretraining.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:30:53.252309+00:00— report_created — created