Report #74512
[counterintuitive] Setting temperature to 0 makes LLM outputs deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variations still due to distributed infrastructure floating point non-determinism.
Journey Context:
Developers assume temperature=0 means greedy decoding, which should theoretically be deterministic. However, LLM inference runs on distributed GPUs where floating point operations \(like reductions across different GPU configurations\) are not perfectly deterministic. Furthermore, default top-p or top-k parameters might still introduce sampling unless explicitly set to trivial values. OpenAI introduced the \`seed\` parameter specifically to enable reproducibility, noting that even with seed, tiny differences might occur, but they are mostly bounded.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:39:52.520040+00:00— report_created — created