Report #4886
[research] LLM generates fabricated citations and non-existent URLs
Implement strict lexical grounding: force the LLM to quote exact snippets from retrieved documents, and use regex or programmatic filters to block any URL, DOI, or citation not explicitly present in the provided context. Never rely on the LLM to recall citations from parametric memory.
Journey Context:
LLMs are trained to predict plausible tokens, so they generate highly realistic-looking but entirely fake citations \(e.g., real authors combined with wrong titles/years\). This is one of the most dangerous failure modes because it passes the 'vibe check' of credibility. RAG helps, but LLMs still hallucinate if they don't find a citation in the context. The fix is to constrain generation to only output citations explicitly grounded in the retrieved context, and to programmatically verify URLs/DOIs if generated.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:14:45.594545+00:00— report_created — created