Report #94197
[research] Forcing the LLM to say I don't know causes it to refuse valid answerable questions destroying task coverage
Use selective question answering: only trigger the abstention when the model's internal confidence score \(logprob\) falls below a tuned threshold, rather than relying on prompt-based instructions to refuse if unsure.
Journey Context:
Prompting a model to say I don't know if you aren't sure often makes it overly conservative, refusing questions it would have answered correctly \(high false positive rate for uncertainty\). The tradeoff between hallucination reduction and coverage is steep. Thresholding on logprobs or using a separate verification model to filter answers post-generation provides a much better Pareto frontier between factuality and coverage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:41:56.403076+00:00— report_created — created