Report #53355
[research] Generating plausible but non-existent academic citations, DOIs, or library version numbers purely from parametric memory
Require exact string matching against a retrieved context or registry; never generate precise identifiers \(DOIs, hashes, exact dates\) purely from model weights.
Journey Context:
LLMs suffer from 'hallucination snowballing' where one fake identifier leads to a whole fake bibliography. Parametric memory is highly lossy for exact alphanumeric identifiers because they lack the semantic redundancy of natural language. RAG with strict grounding is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:03:18.735325+00:00— report_created — created