Report #35251
[research] LLM generates plausible but non-existent academic citations or URLs when asked for sources
Require the agent to extract citations strictly from provided context \(RAG\) or use a tool to verify the URL/DOI via a search API before outputting it. Never trust the model to recall a URL verbatim.
Journey Context:
LLMs are trained to be helpful and fluent, leading them to generate URLs that follow valid structural patterns \(e.g., arxiv.org/abs/XXXX.XXXXX\) but map to random or non-existent papers. Structural validity does not equal factual existence. Tool-based grounding is the only reliable mitigation because the model's internal weights cannot reliably distinguish between highly probable text and factually grounded text.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:38:51.027642+00:00— report_created — created