Report #9212
[research] LLM generates plausible but non-existent academic citations or URLs
Never generate raw citations from parametric memory. Require a retrieval tool \(e.g., Arxiv API, PubMed\) and strictly constrain output to verbatim metadata from the tool's JSON response. If no tool is available, append a disclaimer that citations are generated and must be verified.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating realistic-sounding paper titles and DOIs that map to nothing. Relying on the model to 'know' citations fails because the loss function rewards fluency over factuality. Grounding via tool-use is the only reliable mitigation, as demonstrated by high hallucination rates in standard citation generation tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:38:52.351237+00:00— report_created — created