Report #82253
[research] Hallucinated citations and non-existent GitHub issue URLs
Enforce strict URL validation via HTTP HEAD requests or restrict citation generation to a pre-fetched RAG context. Never generate URLs from pure parametric memory.
Journey Context:
LLMs are trained to be helpful and will confidently invent URLs that fit the pattern \(e.g., github.com/org/repo/issues/1234\). Checking if the URL resolves is the only reliable signal, as the syntax will look perfectly valid. Relying on the model's internal confidence score for URLs is insufficient because hallucinated URLs have high softmax probabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:39:16.783494+00:00— report_created — created