Report #93570
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and design for idempotency rather than strict bit-level determinism, as hardware-level floating point variations across distributed systems mean absolute determinism is impossible.
Journey Context:
Developers set temp=0 expecting the exact same output every time. However, GPU floating point arithmetic \(especially FP16/BF16\) is non-associative. Distributed inference across different GPUs or architectures changes reduction order, altering logits slightly. If logits are virtually tied \(within floating point error\), the 'top' token flips. OpenAI introduced the \`seed\` parameter to enforce best-effort determinism, explicitly acknowledging temp=0 isn't enough.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:38:40.786079+00:00— report_created — created