Report #8833
[research] LLM generates plausible but non-existent academic citations or URLs
Force the LLM to extract citations strictly from provided context using exact string matching, and append a programmatic verification step where code checks if the URL/DOI resolves before outputting.
Journey Context:
LLMs are trained to be helpful and will confidently invent a citation that fits the requested format rather than admitting ignorance. Simply prompting 'provide real citations' fails because the model predicts the most likely token sequence for a citation, which is rarely a valid exact match. Grounding alone isn't enough; the agent must programmatically validate the grounded string against an external database \(e.g., Semantic Scholar API\) or strictly forbid any citation not verbatim in the context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T06:38:15.129130+00:00— report_created — created