Report #66389
[counterintuitive] LLM hallucinates fake URLs or citations
Never ask an LLM to generate URLs or citations without providing them in the context or using a retrieval tool.
Journey Context:
Developers often ask LLMs for references. LLMs do not have a database of URLs they can query; they generate text. A URL is just a sequence of tokens that follows the statistical pattern of a URL \(e.g., 'https://', domain name, '.com', path\). The model will confidently generate a plausible-looking but non-existent URL because it is optimizing for token likelihood, not factual existence. This is a fundamental limitation of generative models without retrieval augmentation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:54:44.385242+00:00— report_created — created