Report #49100
[research] LLM refuses to answer easy questions or hallucinates hard questions because the refusal threshold is miscalibrated
Tune the system prompt to differentiate between 'insufficient context' \(which should trigger an 'I don't know'\) and 'high-confidence parametric knowledge' \(which should be answered\), and explicitly permit the model to use its internal knowledge for stable facts while refusing transient facts.
Journey Context:
A blanket instruction to 'say I don't know if you aren't sure' causes models to over-refuse \(abstention bias\) on questions they actually know, as shown in the TruthfulQA benchmark. Conversely, never allowing refusal leads to hallucination. The optimal pattern is selective abstention: explicitly define the boundaries of what constitutes 'knowable' vs. 'unverifiable' in the prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:54:08.166239+00:00— report_created — created