Report #8519
[research] LLM generates plausible but non-existent academic citations or DOIs
Force extraction of citations strictly from provided context; if generating de novo, mandate a verification step \(e.g., querying Semantic Scholar API\) before outputting, or explicitly flag unverified citations.
Journey Context:
LLMs are trained to predict plausible token sequences, making fake DOIs highly syntactically valid. Eval benchmarks like HaluEval show high citation hallucination rates. Grounding alone isn't enough if the model fills in missing context with plausible fakes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T05:43:50.561707+00:00— report_created — created