Report #49850
[research] LLM generates plausible but non-existent academic citations and URLs
Implement strict regex validation for identifiers \(e.g., ArXiv IDs, DOIs\) and force the LLM to only cite from a provided context, or use a tool to verify the URL returns HTTP 200 before outputting.
Journey Context:
LLMs are trained to be helpful and will synthesize URLs that follow the correct structural pattern \(e.g., arxiv.org/abs/XXXX.XXXXX\) but point to random or missing papers. Structural plausibility does not equal existence. Prompting alone fails because the model cannot distinguish between generated and retrieved tokens; tool-use or strict grounding is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:09:27.756487+00:00— report_created — created