Report #17687
[research] LLM generates plausible but non-existent academic citations or URLs
Require the agent to extract citations strictly from provided context \(RAG\) or verify via a tool/search API; never generate a URL, DOI, or paper title purely from parametric memory.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating realistic-looking but entirely fake references. Relying on parametric memory for citations has a near-100% failure rate for specific papers because the model interpolates between known authors and buzzwords. Grounding via retrieval is the only reliable mitigation, as demonstrated by low ungrounded citation rates in the ALCE benchmark.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:11:29.820971+00:00— report_created — created