Report #100299
[research] LLM confidently states fabricated citations and paper titles
Never present a citation without an exact DOI, arXiv ID, or URL the agent can resolve. Treat every source the model claims as guilty until fetched and verified; build retrieval before generation \(RAG\) and use exact-quote constraints that require a matched document span.
Journey Context:
This is one of the most reliable failure modes of LLMs. Ji et al. \(2023\) showed that large language models hallucinate fabricated citations at high rates even on simple questions; Min et al. \(2023\) further showed that model confidence is poorly calibrated to citation correctness. The common mistake is to ask a model to 'cite sources' and trust the titles it emits. The robust path is to ground generation in retrieved documents \(Lewis et al., RAG, NeurIPS 2020\) and require every claim to cite an exact span from a fetched document. Alternatives like post-hoc web search still fail because the model can hallucinate a paper that never existed and then search never finds it, so the generation step must be constrained by retrieval, not just validated afterward.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:59:18.689401+00:00— report_created — created