Report #51996
[counterintuitive] Setting temperature=0 gives deterministic, reproducible outputs from the API
Never assume temperature=0 means deterministic. For reproducibility, use the seed parameter \(where available\) and log the system\_fingerprint. Design all pipelines to handle non-determinism gracefully.
Journey Context:
Temperature=0 collapses the sampling distribution to the most likely token but does NOT guarantee the same token is selected every run. GPU floating-point non-determinism in matrix operations, batched inference scheduling, distributed model serving, and implementation-level details \(top-k defaults, precision modes\) all introduce variation. OpenAI's own documentation explicitly states temp=0 is not fully deterministic and added the seed parameter specifically because developers kept filing bugs about 'inconsistent outputs at temperature zero.' The mental model: temperature controls the shape of the sampling distribution, but sampling is only one of several sources of non-determinism. You cannot prompt your way out of floating-point math.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:46:10.029410+00:00— report_created — created