Report #5285
[research] How often do LLMs invent citations, and can they detect fabricated references?
LLMs frequently hallucinate plausible-looking citations. Detect them by consistency checks: ask the model for the paper's authors, title, year, and main claim independently; fabricated references usually produce inconsistent answers. Do not emit a citation without external lookup or these internal consistency checks.
Journey Context:
Agrawal et al. use references as a 'model organism' for hallucination and find that even GPT-4 often generates inconsistent author lists for fake references while recalling real ones consistently. The practical takeaway is that a citation is not trustworthy just because it looks formatted; internal cross-examination catches many fabrications, but the safest fix is retrieval against a real database.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:58:41.813539+00:00— report_created — created