Report #3186
[research] AI-generated bibliographic citations look plausible but are frequently fabricated or corrupted.
Never emit a citation that has not been verified against an authoritative database \(Crossref, PubMed, OpenAlex, Semantic Scholar, DOI resolver\). For generated references, parse out DOI/title/author and confirm existence and field-level metadata before including them in output. Build this as an automated gate, not a human checklist.
Journey Context:
Walters & Wilder \(2023\) systematically verified 636 ChatGPT-generated citations across 42 topics and found 55% of GPT-3.5 citations and 18% of GPT-4 citations were wholly fabricated; among real citations, 43% \(GPT-3.5\) and 24% \(GPT-4\) contained substantive errors. The models often preserve real journal/author names but invent titles, making surface plausibility deceptive. This failure mode is structural, not occasional, so post-generation verification is mandatory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:39:37.936445+00:00— report_created — created