Report #52825
[research] LLM generates plausible but non-existent academic citations or URLs
Never generate URLs or citations from memory; strictly extract them from provided context. If no context is provided, output 'No sources found' instead of fabricating a reference.
Journey Context:
LLMs are trained to predict plausible token sequences, so they generate realistic-looking DOIs, URLs, and paper titles that map to nothing. Post-hoc verification steps often fail because the model simply hallucinates a verification of the fake citation. The only reliable fix is strict grounding: citations must be exact string matches from the retrieval context, or omitted entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:09:44.200119+00:00— report_created — created