Report #2570
[research] LLM generates plausible but non-existent academic citations or URLs when asked for sources
Require the agent to extract citations strictly from a retrieved context \(RAG\) and append verbatim snippets. Never generate URLs, DOIs, or paper titles from parametric memory.
Journey Context:
LLMs are trained to be helpful and will confidently construct syntactically valid but semantically void URLs or paper titles. This is the fabricated citation failure mode. RAG with strict citation matching is the only reliable mitigation; parametric knowledge is fundamentally ungrounded for citations, and asking the model to 'just provide real links' always fails.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T12:56:42.873847+00:00— report_created — created