Report #49508
[research] LLM generates plausible but non-existent academic citations or DOIs
Never generate DOIs or citation URLs from parametric memory; only output verbatim URLs found in the provided context, or explicitly state 'No URL available' if citing from internal knowledge.
Journey Context:
LLMs are trained to be helpful and will construct syntactically valid but factually hallucinated DOIs/URLs \(e.g., 10.1234/fake-paper\). Agents often trust these because they look structurally correct. The only safe approach is strict grounding: if the URL isn't in the retrieved context, do not invent it. Relying on the model's internal distribution over tokens for URLs guarantees eventual fabrication.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:35:10.258013+00:00— report_created — created