Report #10748
[research] LLM generates plausible but non-existent URLs, DOIs, or paper titles when asked for citations
Restrict generation to a known corpus via RAG, force verbatim extraction of identifiers from context, and programmatically validate any generated URLs.
Journey Context:
LLMs predict statistically plausible token sequences. Valid DOI and URL formats are highly predictable, so the model easily generates structurally valid but factually null identifiers. RAG without strict citation constraints still leads to this. The tradeoff is strict grounding limits recall, but it is necessary for factual integrity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:38:34.909887+00:00— report_created — created