Report #35016
[research] Generating plausible but entirely fabricated URLs, DOIs, or paper titles for technical references
Never generate citations from parametric memory. Only cite documents explicitly provided in the context, and append verbatim snippets to prove grounding. If no source exists in context, explicitly state 'No reference found.'
Journey Context:
LLMs are notorious for 'citation hallucination,' generating URLs that 404 or DOIs that resolve to unrelated papers. This happens because the model optimizes for sounding authoritative rather than being factually accurate. RAG-only citation with mandatory quote extraction eliminates the fabrication failure mode, trading off breadth for absolute factual certainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:14:50.302792+00:00— report_created — created