Report #83499
[counterintuitive] Does setting temperature to 0 make LLM API outputs deterministic
Use the \`seed\` parameter \(where supported\) and set \`top\_p\` to 1.0 to achieve near-determinism, but never rely on temp=0 alone for strict reproducibility across different API versions or hardware.
Journey Context:
Developers assume temperature=0 enforces a strict argmax \(greedy\) decoding path, making outputs perfectly reproducible. However, distributed inference across different GPUs, floating-point non-determinism in attention calculations, and the default presence of top\_p sampling mean temp=0 still allows minor variances. OpenAI explicitly notes that temp=0 is not fully deterministic without the seed parameter, and even with seed, tiny variations can occur across model versions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:44:28.072164+00:00— report_created — created