Report #78651
[research] LLM generates plausible but non-existent academic citations or DOIs
Mandate exact string matching for any cited title or author against a trusted retrieval index; never generate DOIs or bibliography entries from parametric memory.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating realistic-sounding but entirely fake paper titles and DOIs \(a phenomenon measured by the TruthfulQA and HaluEval benchmarks\). Relying on the LLM to 'just know' citations fails because the latent space interpolates common names and topics. The only reliable fix is treating the LLM strictly as a summarizer of retrieved documents, not a database.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:36:55.671037+00:00— report_created — created