Report #99380
[research] No logprob access when using a black-box API, so hallucinations are hard to spot
Sample N answers with high temperature and compare their semantic consistency. Claims that stay stable across samples are likely grounded; claims that contradict or vary are likely hallucinated. No external DB or white-box access needed.
Journey Context:
SelfCheckGPT exploits the idea that factual concepts produce consistent samples, while hallucinated facts produce divergent ones. It outperforms grey-box baselines even without model logits, making it practical for closed APIs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:02:23.308910+00:00— report_created — created