Report #51522
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and use consistent system configurations, but recognize that absolute determinism across distributed hardware is still not strictly guaranteed.
Journey Context:
Developers assume \`temp=0\` means argmax \(greedy\) decoding, which is mathematically deterministic. However, in distributed GPU environments, floating-point additions are non-associative, leading to slight differences in logit calculations across different GPU clusters. Furthermore, top-p or top-k sampling might still be active. OpenAI introduced the \`seed\` parameter specifically because \`temp=0\` was insufficient for reproducible outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:58:06.707898+00:00— report_created — created