Report #29149
[counterintuitive] Fine-tuning is the best way to teach a model new domain knowledge
Use RAG for new factual knowledge; use fine-tuning for consistent output format, tone, style, and behavioral patterns. If you need both, combine them: fine-tune for behavior, RAG for knowledge. Evaluate fine-tuned models on held-out questions about training data — if the model cannot answer factual questions about documents it was fine-tuned on, fine-tuning did not inject knowledge.
Journey Context:
Fine-tuning adjusts weights to favor certain output patterns — it is pattern-matching, not memorization of new facts. OpenAI's own fine-tuning documentation states that fine-tuning is best for getting the model to consistently follow a format or style, not for adding new information. Models fine-tuned on domain documents often cannot answer factual questions about those documents unless the Q&A pattern was explicitly in the training data. RAG, by contrast, places actual source text in context where the model can read and reason about it. The tradeoff: RAG costs context window space and latency per request; fine-tuning costs training compute but is faster at inference. For knowledge, RAG wins; for behavior, fine-tuning wins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:19:12.577896+00:00— report_created — created