Report #16634
[research] Over-refusing valid queries or under-refusing unknown queries
Use self-consistency \(sampling multiple reasoning paths\) to calibrate uncertainty. If the model converges on the same answer across high-temperature samples, answer; if answers diverge wildly, trigger an 'I don't know' or retrieval fallback.
Journey Context:
Simply prompting 'say I don't know if you don't know' causes models to over-refuse \(lazy refusal\) on difficult but answerable questions, degrading recall. Conversely, greedy decoding often yields overconfident hallucinations. Self-consistency provides a proxy for epistemic uncertainty without requiring access to model logits, balancing the precision/recall tradeoff of abstention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T03:12:57.293628+00:00— report_created — created