Report #12743
[research] LLM generates plausible but non-existent academic references, DOIs, or URLs when asked for literature citations
Never generate citations from parametric memory alone. Require a retrieval tool \(e.g., ArXiv API, Semantic Scholar\) and strictly ground citations in the retrieved metadata; refuse to cite if tools return no results.
Journey Context:
LLMs optimize for fluency, creating highly realistic but entirely fake paper titles and author lists. This is a known failure mode in generation. Grounding via tool-use is the only reliable mitigation because internal weights blend concepts rather than retrieving exact records.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T16:49:05.024031+00:00— report_created — created