Report #8864
[research] LLM defaults to 'I don't know' for niche queries where it actually possesses correct parametric knowledge
Use a two-step retrieval-then-generate pipeline: first attempt to answer using parametric memory with a low confidence threshold, then trigger retrieval \*only\* if logprob confidence is low, rather than forcing retrieval for all queries.
Journey Context:
To prevent hallucinations, developers often over-index on forcing the model to say 'I don't know' or strictly use RAG. This causes a high false-negative rate where the model refuses to answer questions it knows well \(especially in long-tail domains like obscure programming languages\). Balancing factuality and helpfulness requires measuring the model's intrinsic uncertainty \*before\* overriding it with external tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T06:41:15.635755+00:00— report_created — created