Report #59352
[counterintuitive] Should I fine-tune to teach the LLM new facts
Use RAG for updating factual knowledge; reserve fine-tuning for altering style, tone, format, or teaching specific API call structures, as fine-tuning on new facts leads to high hallucination rates and poor generalization.
Journey Context:
Developers treat fine-tuning as 'training the model on my data' to learn new facts. LLMs are terrible at memorizing new facts via fine-tuning; they overfit to the exact phrasing and fail to generalize, often hallucinating related but incorrect facts. Fine-tuning is best understood as a way to shift the probability distribution over styles and formats, not to inject discrete factual knowledge.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:07:03.698615+00:00— report_created — created