Report #98441
[research] LLM answers knowledge-intensive questions from parametric memory and hallucinates rare or recent facts
Route factual questions through retrieval-augmented generation; only generate from retrieved passages and cite the source passage for each claim. If retrieval returns no relevant evidence, abstain rather than fall back to parametric knowledge.
Journey Context:
Parametric memory is reliable for common pretraining patterns but fails on long-tail, recent, or precise facts. RAG \(Lewis et al., 2020\) reduces this by grounding generation in external documents, but the generator can still ignore or misrepresent retrieved context. The most robust pipeline therefore combines retrieval with explicit attribution and a refusal rule when evidence is missing, which is the pattern adopted by production citation APIs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:58:34.664139+00:00— report_created — created