Report #97398
[research] The model either never admits uncertainty or refuses everything
Abstain only when retrieval returns nothing, the question is outside the model's known domain, and calibrated confidence is low; otherwise answer with appropriately hedged confidence.
Journey Context:
Blanket refusal is useless, but overconfident answers are dangerous. HaluEval shows models are poor at recognizing their own hallucinations. The right policy is conditional abstention: use retrieval status, domain checks, and low-confidence signals together. Varshney et al. demonstrate that validating low-confidence tokens as they are generated reduces hallucination while preserving answer rate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:02:59.402969+00:00— report_created — created