Report #2727
[research] LLM generates plausible-looking but fake citations
Never emit a citation without verifying it against a bibliographic database \(Crossref, OpenAlex, Semantic Scholar\) or the original source; constrain generation to retrieved papers only.
Journey Context:
Studies show ChatGPT fabricates references that appear structurally valid \(Alkaissi & McFarlane 2023\), and recent audits found hundreds of hallucinated citations in peer-reviewed venues. Language models are not citation databases; they predict plausible author/title/venue/DOI patterns. Common mistake: trusting a citation because it is formatted correctly or mentions real-sounding authors. WebGPT's safer pattern is to require every claim to be supported by references collected during browsing and to verify those references before returning them.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:39:51.427768+00:00— report_created — created