Report #18070
[research] Generating plausible but non-existent URLs or DOIs for citations
Never synthesize a URL or DOI from pattern matching alone; strictly extract them from provided context or validate via HTTP HEAD/GET and DOI resolver APIs before outputting.
Journey Context:
LLMs learn URL patterns \(e.g., github.com/org/repo/issues/123\) and generate structurally valid but 404-ing links. Validation is the only defense against this specific failure mode, as the model's confidence is uncorrelated with the link's actual existence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T07:13:01.483590+00:00— report_created — created