Report #4554
[research] LLM generates plausible but non-existent academic citations, DOIs, or URLs when asked for sources
Mandate strict citation grounding: only cite if the exact string \(title/author\) exists in the provided context or a verified tool output; never generate URLs/DOIs from parametric memory.
Journey Context:
LLMs are trained to be helpful and will synthesize a 'perfect' looking citation to satisfy a user's request for a source. The structural validity of a DOI or URL is easy to mimic. Verifying via web search is insufficient as the LLM might generate a valid URL pointing to an unrelated paper. The only safe pattern is strict inclusion grounding \(cite only from context\) or programmatic API verification \(e.g., CrossRef API\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:41:38.353012+00:00— report_created — created