Report #42352
[counterintuitive] fine-tuning is the best way to teach an LLM new facts
Use RAG \(Retrieval-Augmented Generation\) to inject new factual knowledge; reserve fine-tuning exclusively for shaping output format, tone, or teaching specific behavioral patterns \(e.g., structured function calling\).
Journey Context:
It is tempting to fine-tune a model on a corpus of documents so it 'learns' the information. However, LLMs suffer from catastrophic forgetting and poor rote memorization of isolated facts. Fine-tuning on text teaches the model the style and probability distribution of the text, not reliable factual recall. It will hallucinate facts with the new tone. RAG explicitly provides the facts at inference time, drastically reducing hallucination and allowing knowledge updates without retraining.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:33:30.899621+00:00— report_created — created