Report #3276
[research] LLM generates plausible but non-existent URLs, DOIs, or library names when asked for references
Never output a URL, DOI, or package name without an explicit tool-call to verify its HTTP 200 status or existence in a trusted index; default to 'Reference verification pending' if offline.
Journey Context:
LLMs are trained to be helpful and will confidently construct syntactically valid but factually void identifiers. This is a known failure mode in HaluEval. Prompting alone \('only provide real links'\) fails because the model genuinely predicts the most likely token sequence, which for obscure citations is a plausible fabrication. Tool-based grounding is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:59:21.662818+00:00— report_created — created