Report #96198
[research] LLM generates plausible but non-existent academic citations and DOIs
Require the agent to verify citations against a trusted external API \(like Semantic Scholar or Crossref\) before outputting them, or strictly limit citation generation to exact string matches from the provided context window without extrapolation.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating syntactically correct but factually void citations. Simply prompting 'do not hallucinate citations' fails because the model's internal confidence for these fake citations remains high. Grounding alone isn't enough if the model extrapolates beyond the provided text. The only reliable fix is programmatic verification or strict lexical matching against the provided context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:03:06.174418+00:00— report_created — created