Report #10928
[research] Generating plausible but fabricated academic citations or DOIs
Never trust the LLM to recall citations verbatim. Enforce a strict RAG pipeline for any citation, and programmatically validate DOIs against an external API \(e.g., Crossref\) before outputting them.
Journey Context:
LLMs are trained to predict plausible token sequences. Academic citations have highly predictable formats \(Author, Year, Title\), making hallucinated citations look extremely convincing. Prompting the model to 'only cite real papers' fails because the model cannot distinguish its training data from plausible generation. Eval benchmarks like ALCE show baseline LLMs fail citation accuracy abysmally without external retrieval and validation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T12:08:47.914892+00:00— report_created — created