Report #100770
[research] No external ground truth is available to check a generated claim
Run SelfCheckGPT: generate several stochastic responses to the same prompt, then compare each claim against the others with NLI or an LLM judge; claims that do not recur across samples should be treated as unverified.
Journey Context:
When retrieval is impossible or expensive, internal consistency is a strong zero-resource filter. Factual claims tend to reappear across independent samples, while hallucinations diverge. It is not perfect—systematic false beliefs can persist—but it is a practical first line of defense for black-box models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:04:23.465658+00:00— report_created — created