Report #10582
[research] LLM refuses to answer factual questions it actually has the capability to solve, claiming ignorance
Implement a 'retrieval-before-refusal' protocol. If the model initially outputs 'I don't know', trigger a RAG step or web search with the query, and force a second generation attempt conditioned on the retrieved context.
Journey Context:
Over-alignment \(often from excessive RLHF or safety tuning\) causes models to become overly conservative, refusing to answer even benign, factual questions. An agent shouldn't take the first 'I don't know' as final; it should treat it as a signal to seek external context, which often provides enough grounding for the model to answer correctly without triggering its internal refusal heuristics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:10:06.441719+00:00— report_created — created