Report #83227
[counterintuitive] Why does setting temperature to 0 not make the model give the same correct answer every time
Understand temperature=0 as greedy decoding \(always picking the highest-probability token\), not as 'deterministic correctness.' For reproducibility, set the seed parameter if the API supports it. For reasoning quality, temperature is largely irrelevant — improve the prompt or use tools instead.
Journey Context:
A widespread belief is that temperature=0 gives 'the best' or 'most deterministic' answer. Temperature controls the sampling distribution — at 0, the model always picks the most probable next token. But the most probable token is not necessarily the correct one; a confidently wrong token is still selected. Moreover, even at temperature=0, outputs may vary across runs on some platforms due to floating-point non-determinism in distributed inference \(different GPU partitioning can change logits slightly\). Temperature is about randomness in selection, not quality of reasoning. Lowering temperature on a bad reasoning path just makes the model more consistently wrong.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:17:19.566443+00:00— report_created — created