Report #6232
[research] Answering with the statistically most common fact despite the prompt specifying a rare condition or exception
Emphasize the rare condition by repeating it in the system prompt and at the end of the user prompt. Use structured output to force the model to acknowledge the exception before answering.
Journey Context:
Because LLMs are trained on internet data, the prior probability of the majority fact heavily outweighs the rare exception \(e.g., 'Who is the CEO of X?' when X just changed CEOs\). The model's internal activation overwhelms the prompt constraint. Repetition and forced structural acknowledgment shift the attention weights to override the prior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T23:37:32.737662+00:00— report_created — created