Report #50448
[research] LLM generates plausible but completely fabricated academic citations or DOIs
Never trust LLM-generated citations without programmatic verification against a database \(e.g., Semantic Scholar API, Crossref\); instruct the model to only cite explicitly provided context chunks.
Journey Context:
LLMs excel at mimicking the structure of academic references \(author names, title formats, plausible DOIs\) but fail at factual recall of specific papers. Asking the LLM to 'be accurate' does not eliminate this; the only fix is hard external grounding or strict RAG boundaries where the model is forbidden from relying on parametric memory for citations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:09:36.242414+00:00— report_created — created