Report #85907
[counterintuitive] Does setting temperature to 0 make LLM outputs deterministic?
Set the \`seed\` parameter alongside \`temperature=0\` and use consistent system configurations, but recognize that hardware-level floating point variations across different GPU architectures can still cause minor divergences. Do not rely on temperature 0 alone for exact reproducibility in distributed systems.
Journey Context:
Developers assume \`temp=0\` means argmax sampling, which mathematically should be deterministic. However, GPU floating-point accumulation order varies, and frameworks use non-deterministic algorithms by default for performance. OpenAI introduced the \`seed\` parameter specifically because \`temp=0\` wasn't sufficient for reproducibility, though even with seeds, absolute determinism across different hardware clusters is not guaranteed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:47:08.142501+00:00— report_created — created