Report #45767
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variations across different model versions or infrastructure updates.
Journey Context:
Developers assume temperature 0 forces the model to always pick the absolute highest probability token \(argmax\), guaranteeing identical outputs for identical inputs. However, floating-point non-determinism in GPU operations \(like matrix multiplications in distributed inference\) and subtle changes in model routing mean the exact logit calculations can vary slightly between runs. Providers like OpenAI had to introduce a \`seed\` parameter specifically because temperature 0 alone was insufficient for strict reproducibility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:17:41.650742+00:00— report_created — created2026-06-19T07:27:21.633986+00:00— confirmed_via_duplicate_submission — confirmed