Report #22386
[cost\_intel] Fine-tuning to teach a model new reasoning capabilities instead of format/style.
Fine-tune small models \(GPT-4o-mini, Haiku\) to adopt a specific output format, tone, or style, but use prompting or larger models for tasks requiring new reasoning. Fine-tuning is a cost multiplier for format adherence, not a reasoning upgrade.
Journey Context:
A common mistake is trying to fine-tune a small model to perform complex logical reasoning it couldn't do via prompting, hoping it will 'learn' the logic. Fine-tuning adjusts weights to favor specific patterns, making it incredibly effective for forcing a model to output YAML instead of JSON, or to adopt a specific voice. If the base model lacks the reasoning capability, fine-tuning will just make it confidently wrong in your desired format. Use frontier models for the reasoning, and fine-tune small models for cheap, high-volume stylization/formatting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:59:04.473900+00:00— report_created — created