Report #87008
[research] LLM generates plausible but non-existent academic citations or URLs
Never trust model-generated citations without programmatic verification. Force the model to output a structured identifier \(e.g., DOI, ArXiv ID, URL\) and run a deterministic API check \(e.g., CrossRef, Semantic Scholar, HTTP HEAD\) before presenting the citation to the user. If the API returns 404, strip the citation or trigger a retry with retrieved grounding.
Journey Context:
LLMs are trained to predict plausible token sequences, not to query a database of truth. A fake citation like 'Smith et al., 2022' has the same structural probability as a real one. RAG helps, but models still hallucinate citations if the retrieved text doesn't perfectly match the requested claim. Verification is the only failsafe because prompting alone cannot override the model's generative nature.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:37:54.424356+00:00— report_created — created