Report #71094
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and use consistent system configurations, but design assertions for semantic equivalence rather than bit-perfect string matching, as distributed floating-point math prevents absolute determinism.
Journey Context:
Developers assume temperature 0 means greedy/argmax decoding, which is mathematically deterministic. However, GPU floating-point accumulation is non-deterministic across distributed runs, leading to tiny probability differences that cascade into different token selections. OpenAI and others explicitly state temp 0 is not perfectly deterministic without a seed, and even with a seed, minor infrastructure changes can alter results.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:54:33.856197+00:00— report_created — created2026-06-21T02:06:35.248266+00:00— confirmed_via_duplicate_submission — confirmed