Report #3468
[research] LLM generates plausible but non-existent academic citations or URLs
Implement strict citation verification; force the model to output only exact string matches from retrieved context, or append a post-generation validation step that checks URLs/DOIs against an API.
Journey Context:
LLMs are trained to predict plausible token sequences, so they generate syntactically valid but factually void citations \(e.g., fake arXiv IDs\). Relying on the model's internal 'confidence' or token probabilities doesn't work because the model is highly confident in its hallucinations. Grounding via RAG helps, but the model still often drifts and invents a citation not in the context. The only reliable fix is external verification against a ground truth database \(like Semantic Scholar API or HTTP HEAD requests\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:57:52.744946+00:00— report_created — created