Report #99825
[research] Single-sample LLM output contains unverifiable factual claims
Generate multiple independent samples for the same question and compare them. Claims that vary across samples are strong hallucination signals; claims that remain consistent are more likely to be grounded in the model's actual knowledge.
Journey Context:
Manakul et al. found that factual claims produce consistent samples, while hallucinations vary across stochastic outputs. SelfCheckGPT exploits this zero-resource signal. For coding agents, this means sampling multiple bug explanations or API usage examples and checking whether they agree on the critical facts. It is a cheap first filter before expensive verification or execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:07:17.222453+00:00— report_created — created