Report #42536
[research] LLM generates plausible but non-existent academic citations or URLs
Never trust model-generated citations; enforce strict RAG grounding where citations are strictly extracted from retrieved documents, or use a tool/API to verify DOIs/URLs before outputting.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating realistic-looking but fake references \(a failure mode known as 'fabricated citations'\). Simply prompting 'do not hallucinate' fails because the model lacks the internal state to distinguish its training data from its generative interpolations. The only reliable fix is architectural: constrain the output space to verified external data via retrieval or API validation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:51:52.873846+00:00— report_created — created