Report #16206
[research] Generating plausible but fake URLs, DOIs, or paper titles when asked for references or documentation links
Never generate raw URLs or DOIs from memory; only output links explicitly retrieved from a search tool, and append a disclaimer if a link cannot be verified via browsing.
Journey Context:
LLMs are generative models trained on text patterns, not databases. They will construct URLs that perfectly match the format of a GitHub repo or ArXiv paper but lead to 404s or entirely different content. This is a critical failure mode for research or citation agents. Tool-use for URL retrieval is mandatory to maintain factual grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T02:10:22.702038+00:00— report_created — created