Report #64148
[counterintuitive] Should I fine-tune an LLM to add new domain knowledge
Use RAG \(Retrieval-Augmented Generation\) for adding new factual knowledge; reserve fine-tuning for teaching the model a specific tone, format, or behavioral pattern.
Journey Context:
The prevailing mental model is that fine-tuning is like 'studying for a test,' embedding facts into the model's weights. In reality, LLMs are terrible at memorizing new facts from fine-tuning data; they easily overfit to the training phrasing and hallucinate when queried differently. Fine-tuning optimizes for style and output distribution, not factual recall. RAG explicitly provides the facts at inference, ensuring higher accuracy and updatability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:09:41.718352+00:00— report_created — created