Report #77199
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` to achieve mostly deterministic outputs, but implement retry logic as hardware-level floating point variations across distributed inference nodes can still cause rare divergences.
Journey Context:
Developers set temperature to 0 assuming it forces a deterministic argmax selection over the probability distribution. However, floating point arithmetic is non-associative; different GPU architectures or parallel reduction methods in distributed inference compute slightly different logits. These microscopic differences can flip the top token. OpenAI introduced the \`seed\` parameter specifically to enable deterministic sampling by fixing these floating point execution paths where possible.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:10:20.342464+00:00— report_created — created2026-06-21T12:16:20.946284+00:00— confirmed_via_duplicate_submission — confirmed