Report #39235
[research] LLM generates plausible but fabricated academic citations \(titles, authors, DOIs\) when asked for literature references
Never generate raw citations from parametric memory. Require a retrieval tool \(e.g., Semantic Scholar API, ArXiv search\) and strictly constrain output to the retrieved metadata. If no tool is available, append a disclaimer that citations are generated and must be verified.
Journey Context:
LLMs are trained to be helpful and fluent, which leads them to interpolate plausible titles rather than admitting ignorance. This is notoriously prevalent in GPT-3/4 and Claude. The 'fix' of just prompting 'do not hallucinate' fails because the model doesn't have ground-truth boundary awareness for citations. Tool-use is the only reliable mitigation, as proven by the high hallucination rate in the ALCE benchmark for ungrounded citation generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:19:38.003413+00:00— report_created — created