Report #82191
[counterintuitive] does temperature 0 make LLM deterministic
Set the \`seed\` parameter alongside \`temperature=0\` to achieve mostly deterministic outputs, but do not rely on 100% reproducibility across different hardware or major model versions due to floating-point non-determinism in distributed GPU inference.
Journey Context:
Developers set temp=0 expecting exact reproducibility for testing. However, temp=0 only forces greedy decoding \(argmax over logits\). The logits themselves can vary slightly due to floating-point accumulation differences across different GPU architectures or parallel thread execution orders \(atomic adds\). OpenAI introduced the \`seed\` parameter specifically to address this, but even with seed, absolute determinism is only guaranteed under identical infrastructure and model weights.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:33:10.904127+00:00— report_created — created