Report #51150
[counterintuitive] Should I fine-tune an LLM to teach it new facts or domain knowledge
Use RAG for injecting new knowledge or facts; reserve fine-tuning for shaping output format, tone, or teaching new behavioral patterns and skills.
Journey Context:
Developers treat fine-tuning like a database update, assuming that training on new facts will embed them reliably. However, LLMs struggle to memorize rare facts from fine-tuning data without massive repetition, and doing so severely degrades their calibration—they become highly confident about the new facts but still hallucinate when they forget, losing the ability to express uncertainty. Fine-tuning is excellent for altering the distribution of outputs \(e.g., forcing JSON, adopting a persona\) but terrible for recall of specific, discrete knowledge.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:20:42.582203+00:00— report_created — created