Report #12103
[research] Generating plausible but non-existent academic citations or URLs
Never generate URLs, DOIs, or citations from parametric memory. Always use a retrieval tool to fetch real citations, or validate generated URLs via an HTTP GET request before including them in the output.
Journey Context:
LLMs learn the syntactic patterns of DOIs and URLs \(e.g., 'https://doi.org/10.xxxx/...'\) but lack a lookup table of the internet. They will confidently output fabricated links that resolve to 404s. Validation is strictly necessary because prompt engineering alone cannot suppress this structural pattern-matching tendency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T15:09:01.677786+00:00— report_created — created