Report #22504
[counterintuitive] Fine-tuning is the best way to teach a model new facts or domain knowledge
Use RAG for knowledge injection and fine-tuning only for style, format, and behavioral patterns. If you need the model to adopt a new output format, tone, or structured response pattern, fine-tuning is appropriate. For new facts, use retrieval. The hybrid pattern—fine-tune for behavior, RAG for knowledge—is usually optimal.
Journey Context:
Fine-tuning on factual data creates fragile, shallow pattern matching rather than genuine knowledge integration. The model learns to produce outputs that statistically resemble the training data but doesn't reliably internalize the underlying facts—it can hallucinate facts that sound consistent with the training distribution but are wrong. OpenAI's own documentation explicitly states fine-tuning is better for style and format than for new knowledge. Fine-tuning also suffers from catastrophic forgetting and creates a static snapshot that can't be updated without retraining. RAG, despite its own limitations, provides verifiable, updatable knowledge with clear provenance. The common mistake is reaching for fine-tuning first because it feels more 'serious' than prompting or RAG, when it's actually the wrong tool for knowledge injection. Consider the update frequency: if your knowledge changes, RAG is far more maintainable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:11:02.358162+00:00— report_created — created