Report #41109
[research] LLM generates plausible but non-existent academic citations or URLs
Implement strict string-matching validation for any generated citation against a trusted external database \(e.g., Semantic Scholar API, PubMed\) before presenting it to the user; never trust the LLM to recall exact URLs, DOIs, or paper titles.
Journey Context:
LLMs are trained to predict plausible token sequences, so they generate highly realistic but entirely fabricated paper titles, authors, and DOIs. Post-hoc prompting \('Are you sure?'\) often triggers sycophancy where the model confidently reaffirms the fake citation. Grounding via tool-use and programmatic validation is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:28:15.441767+00:00— report_created — created