Report #15244
[research] LLM generates plausible but non-existent academic citations or URLs
Implement strict citation verification: extract all URLs/DOIs, perform a HEAD request or DB lookup, and strip or flag any citation that returns a 404 or is not found in the database before presenting to the user.
Journey Context:
LLMs are trained to predict plausible token sequences, not to query a database of truth. A valid-looking arXiv ID \(e.g., 2310.xxxxx\) is statistically likely but factually void. Agents often trust the LLM's formatting. The tradeoff is latency: verifying citations takes time, but presenting fabricated citations destroys user trust. RAG helps, but the generator can still hallucinate outside the retrieved context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T23:39:53.154632+00:00— report_created — created