Report #94787
[counterintuitive] Should I fine-tune an LLM to add new domain knowledge facts
Use RAG for adding new factual knowledge; reserve fine-tuning for shaping output format, tone, and behavioral patterns \(e.g., learning a new API syntax\).
Journey Context:
Developers assume fine-tuning works like human studying—reading text to learn facts. LLMs are bad at memorizing rare facts via gradient updates; they generalize patterns instead. Fine-tuning on a few documents often leads the model to learn the style of the text but hallucinate the facts, creating highly confident, ungrounded outputs. RAG explicitly separates the knowledge \(retrieved text\) from the reasoning \(model weights\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:41:01.530927+00:00— report_created — created