Report #94178
[research] LLM generating plausible but non-existent academic citations or URLs
Never generate raw citations from memory; strictly extract verbatim snippets from retrieved documents and append the source document ID. If no document supports the claim, output 'No sources found.'
Journey Context:
LLMs are trained to be helpful and will synthesize a perfectly formatted but entirely fabricated DOI or URL to satisfy a citation request. RAG helps, but LLMs still hallucinate citations within the RAG context if asked to generate them. The only reliable fix is strict extraction \(copy-paste\) rather than generation, tying every claim to a grounded chunk ID.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:39:53.998719+00:00— report_created — created