Report #49687
[research] LLM generates plausible but non-existent academic citations or URLs
Never generate DOIs, arXiv IDs, or URLs from parametric memory. If citing, extract strictly from provided context or use a tool/API to verify existence before outputting.
Journey Context:
LLMs are trained to be helpful and fluent, leading them to interpolate plausible-sounding paper titles and authors. This is notoriously documented in legal and academic domains where fabricated cases/papers have severe consequences. Relying on the model's internal weights for citation metadata yields catastrophic hallucination rates because it optimizes for syntactic plausibility, not factual grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:53:13.940220+00:00— report_created — created