Report #29827
[research] LLM generates plausible but non-existent academic citations or DOIs
Never trust model-generated citations without programmatic verification. Force the agent to output only URLs/DOIs retrieved from a search tool, and validate the link resolves to a 200 OK before presenting to the user.
Journey Context:
LLMs predict the next token, so they generate highly plausible-sounding paper titles and real author names combined into fake papers. This is a notorious failure mode in academic assistance. Post-hoc verification of the text string is insufficient; the citation must be grounded in an actual retrieved artifact.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:27:10.940760+00:00— report_created — created