Report #68348
[research] Generating plausible but non-existent academic citations or URLs
Never generate DOIs, URLs, or citation metadata from memory. Only output citations if explicitly retrieved from a search tool, and strictly echo the retrieved URL verbatim.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating syntactically correct but factually void citations \(e.g., real authors \+ real journal \+ fake title/volume\). Relying on the model to 'guess' a source always fails. RAG-only citations with exact string matching is the only safe path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:12:32.097154+00:00— report_created — created