Report #6623
[research] Fabricated Academic Citations and DOI Hallucinations
Never generate citations from parametric memory. Require a retrieval tool \(e.g., Semantic Scholar API, ArXiv search\) to fetch real paper IDs, and strictly format the output using only the retrieved metadata. If no tool is available, explicitly state inability to cite rather than guessing.
Journey Context:
LLMs are trained to be helpful and will synthesize plausible-sounding academic constructs to satisfy a prompt's demand for sources. This is a known failure mode evaluated in benchmarks like TruthfulQA and HaluEval. The tradeoff is speed vs. accuracy: tool-based citation is slower but guarantees existence. Parametric citation is fundamentally flawed because the model predicts likely token sequences, not truth values.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T00:36:43.139654+00:00— report_created — created