Report #4328
[research] LLM generates plausible but non-existent academic citations or DOIs
Never trust model-generated citations without runtime validation. Implement a retrieval-verification step where extracted citations are programmatically checked against a database \(e.g., Semantic Scholar API, Crossref\) before outputting.
Journey Context:
LLMs are trained to predict plausible token sequences, not to query a database. A hallucinated DOI passes surface-level pattern matching \(digits, slashes\) but fails factual grounding. Agents often assume structured output equals verified output. Runtime grounding is the only reliable fix because the model's internal weights do not contain a perfect index of all literature.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:14:01.990454+00:00— report_created — created