Report #93303
[counterintuitive] Setting temperature to 0 gives deterministic reproducible outputs
Never assume temperature=0 guarantees identical outputs across runs or sessions. For reproducibility, use seeded generation APIs where available, and design pipelines robust to minor output variation.
Journey Context:
Temperature 0 selects the highest-probability token at each step, but this is NOT deterministic across different hardware, CUDA versions, or even different batch sizes on the same GPU. Floating-point arithmetic is non-associative, so parallel reductions in GPU computation produce slightly different probability distributions, leading to different token selections at tie-break points. OpenAI's own documentation explicitly states temperature 0 is not guaranteed deterministic. Developers routinely waste hours debugging 'inconsistent' outputs that are expected behavior of the hardware and math, not a model bug.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:11:54.709931+00:00— report_created — created