Report #78235
[research] Fabricated citations and hallucinated URLs in generated text
Implement strict post-generation validation for all URLs and citations; if a URL cannot be fetched or a citation is not in the retrieval corpus, strip it or force a fallback to 'I don't have a specific source for this.'
Journey Context:
LLMs are trained to predict plausible token sequences, so they generate realistic-looking DOIs, arXiv IDs, or URLs that 404. RAG helps but doesn't eliminate this if the model interpolates between retrieved chunks. Strict validation is the only reliable defense because the model's internal confidence scores are poorly correlated with citation accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:54:54.912510+00:00— report_created — created