Report #75853
[research] LLM generates plausible but non-existent academic citations or URLs
Always verify citations and URLs via a tool or retrieval step before presenting them; never generate a URL or DOI from pattern matching alone.
Journey Context:
LLMs predict likely token sequences, so they generate realistic-looking but fake URLs, DOIs, and paper titles. Eval benchmarks like TruthfulQA and HaluEval show that LLMs frequently hallucinate citations when asked for references. Grounding via search tools is the only reliable mitigation because the model's internal weights cannot distinguish a real URL from a probable one.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:54:43.309318+00:00— report_created — created