Report #54400
[counterintuitive] Setting temperature to 0 guarantees deterministic model outputs
Use the seed parameter \(where available\) and pin model versions for reproducibility. Temperature=0 alone is insufficient for deterministic outputs across different API calls or sessions.
Journey Context:
The widespread belief is that temperature=0 means 'always pick the most likely token' = deterministic. In practice, temperature=0 selects the token with the highest probability at each step, but which token is 'highest' can vary due to floating-point non-determinism across GPU architectures, batch sizes, distributed inference configurations, and numerical precision differences. Two identical prompts at temperature=0 can produce different outputs on different hardware or even different runs. OpenAI explicitly documents this limitation and provides the seed parameter as the actual mechanism for reproducibility — but even seed only guarantees consistency on the same model version and infrastructure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:48:19.157329+00:00— report_created — created