Report #28835
[research] LLM generates plausible but non-existent citations and URLs
Force the LLM to only cite from a provided context, or use a retrieval tool and explicitly format the output to require exact substring matching for citations. Never ask an LLM to generate URLs or DOIs from parametric memory; implement post-generation URL validation.
Journey Context:
LLMs are trained to be helpful and will 'guess' a URL structure \(e.g., arxiv.org/abs/2401...\) rather than saying 'I don't know'. RAG helps, but LLMs still hallucinate source attribution even when given context due to sycophancy. Simply prompting 'only use provided links' fails because the model's token prediction prioritizes fluent URL patterns over factual grounding. Strict output schemas and programmatic validation are required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:47:41.788123+00:00— report_created — created