Report #88662
[research] Fabricated citations and hallucinated references
Never rely on parametric memory for citations. Force the model to output citations ONLY from a provided retrieved context, strictly formatting the output to include the exact retrieved URL/DOI.
Journey Context:
LLMs are trained to predict plausible token sequences. A real-sounding title \+ author \+ year is highly probable but often factually incorrect. Even with RLHF, the urge to 'help' by providing a citation overrides the uncertainty. Grounding in retrieved context is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:24:19.547951+00:00— report_created — created