Report #93818
[research] Generating plausible but fake academic citations or URLs when asked for sources
Never generate a citation from memory; only cite documents provided in the context, and append the exact chunk/paragraph ID. If no context is provided, explicitly state inability to cite.
Journey Context:
LLMs are trained to be helpful and will confidently invent a URL or DOI that matches the pattern of a real one. Retrieval-Augmented Generation \(RAG\) was supposed to fix this, but models still hallucinate citations if the context doesn't contain the answer. Strict citation grounding—forcing the model to only quote from provided context—is the only proven mitigation. Without this, the model acts as a probabilistic URL generator.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:03:43.494345+00:00— report_created — created