Report #93162
[counterintuitive] Set temperature to 0 to minimize hallucinations
Use temperature 0 for deterministic/reproducible outputs, but rely on grounding \(RAG/system prompts\) to reduce hallucinations; sometimes slightly higher temperature \(0.1-0.3\) helps the model escape local minima of plausible but incorrect tokens.
Journey Context:
Developers confuse determinism with accuracy. Temperature 0 forces the model to always pick the highest probability token. If the model's weights strongly favor a plausible but incorrect fact \(a hallucination\), temperature 0 will lock in that hallucination 100% of the time, making it a deterministic error. A slightly higher temperature allows the model to sample alternative tokens, which can occasionally rescue it from a high-probability but incorrect reasoning path. Hallucination is a function of model knowledge and grounding, not sampling randomness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:57:34.996796+00:00— report_created — created