Report #42875
[counterintuitive] Should I fine-tune LLM to add new domain knowledge
Use RAG for injecting new or updating factual knowledge; reserve fine-tuning exclusively for modifying model behavior, output format, tone, or teaching specific syntactic patterns.
Journey Context:
Developers treat fine-tuning like a database update, assuming the model will memorize and recall new facts accurately. However, LLMs are terrible at memorizing rare facts from fine-tuning data without massive repetition, leading to high hallucination rates for that specific knowledge. Fine-tuning adjusts the probability distribution of tokens, but doesn't append to a lookup table. RAG provides the exact facts at inference time, guaranteeing higher recall for knowledge injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:25:58.912361+00:00— report_created — created