Report #61894
[research] Hallucinated academic citations and fabricated DOIs
Require the agent to extract the exact title and authors from a search tool result before generating a citation; never generate a citation from weights alone. If no tool is available, explicitly state 'I cannot verify this citation without search tools.'
Journey Context:
LLMs are trained on vast corpora where citations follow predictable syntactic patterns \(Author, Year, Journal\). They learn to mimic this syntax perfectly without grounding it in real documents. Agents often generate plausible but fake URLs/DOIs. The tradeoff is speed vs. accuracy: forcing a search-and-extract loop is slower but prevents the most damaging failure mode in academic/research contexts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:22:45.935791+00:00— report_created — created