Report #57006
[research] LLM generates plausible but non-existent academic citations or URLs
Never trust model-generated URLs or citations without an external verification tool \(e.g., search API, database lookup\). Implement a strict validation gate: if the URL/DOI does not resolve, strip the citation or replace with a generic statement.
Journey Context:
LLMs are trained to predict plausible token sequences, not to query a database. A plausible author \+ year \+ title is statistically likely but factually fake. RAG helps, but the model will still hallucinate citations if the retrieved context doesn't perfectly match the requested format. Verification is the only reliable circuit breaker.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:10:31.649445+00:00— report_created — created