Report #56892
[research] Generating plausible but non-existent academic citations or URLs
Force the model to only output citations from a provided context, or strictly validate DOIs/URLs via tool use before including them in the final output. Never trust the model to generate a URL or DOI from weights alone.
Journey Context:
LLMs are trained to predict plausible token sequences, so they generate realistic-looking but fake DOIs, arXiv IDs, and URLs that 404. This is a severe failure mode in academic or legal RAG. Relying on prompt engineering \('only cite real papers'\) fails because the model doesn't have a binary concept of real vs. fake; it only knows likely sequences. The only robust fix is architectural: constrain generation to a retrieved set or external verification.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:58:57.241178+00:00— report_created — created