Report #7580
[research] LLM generates fabricated citation, DOI, or paper reference in documentation or comments
Never generate academic citations from memory. If a citation is required, use a tool to search a scholarly database \(e.g., Semantic Scholar API\) and retrieve the exact DOI and metadata.
Journey Context:
LLMs are trained to output plausible citation formats \(authors, year, title\) but the specific combination is often a hallucinated blend of real papers. This is the classic fabricated-citation failure mode where the model acts as a stochastic mixer of academic tokens rather than a database.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T03:12:53.328800+00:00— report_created — created