Report #69564
[counterintuitive] Setting temperature to 0 makes model outputs deterministic and reproducible
If you need deterministic outputs, use the seed parameter \(where available\) and log outputs for replay. Accept that temperature=0 only reduces variance—it does not guarantee identical outputs across calls. For reproducibility, store and replay rather than regenerate.
Journey Context:
Temperature=0 selects the highest-probability token at each step, but GPU floating-point operations are non-deterministic across different hardware and batch configurations. The softmax computation over logits can produce slightly different rankings on different runs, leading to different token selections. OpenAI explicitly documents this and provides a seed parameter to improve \(not guarantee\) reproducibility. This matters for testing, CI/CD pipelines, and any system that assumes identical outputs given identical inputs. The widespread belief that temperature=0 equals deterministic causes subtle bugs in evaluation pipelines and reproducibility claims.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:14:58.086025+00:00— report_created — created