Report #74146
[counterintuitive] Fine-tuning LLMs is the best way to teach them new facts or domain knowledge
Use RAG for injecting new factual knowledge; reserve fine-tuning exclusively for altering tone, format, or behavioral patterns.
Journey Context:
Developers treat fine-tuning like training a human who studies a textbook. In reality, LLMs have terrible sample efficiency for rote memorization during fine-tuning. They will overfit to the exact phrasing of the training data and hallucinate when queried differently. Fine-tuning updates weights to adjust the probability distribution of how to respond, not what the ground truth is. RAG explicitly separates the reasoning engine from the external knowledge base, ensuring verifiable, up-to-date facts without destabilizing the base model's capabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:03:01.993851+00:00— report_created — created