Report #22451
[research] LLM generates plausible but non-existent academic citations or URLs when asked for sources
Force the model to only output verbatim spans from the provided context \(closed-domain\) or strictly format citations as \[Source N\] mapping to a provided document list. Never ask the model to generate URLs, DOIs, or academic paper titles from parametric memory.
Journey Context:
LLMs are trained to be helpful and will invent highly realistic-looking citations \(authentic authors, plausible titles, valid-looking DOIs\) rather than admitting ignorance. This is a known failure mode in RAG where the model fails to ground but still tries to satisfy the user's request for a source. Eval benchmarks like ALCE show that forcing extractive citation drastically reduces fabrication compared to abstractive citation, trading off fluency for verifiable factuality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:05:54.606362+00:00— report_created — created