Report #74241
[counterintuitive] Setting temperature to 0 guarantees deterministic reproducible outputs from the API
If you need exact reproducibility, use the seed parameter \(where available\) and log the system\_fingerprint. Temperature=0 reduces variance but does not eliminate it—always add application-level idempotency or caching for critical paths.
Journey Context:
Temperature=0 selects the highest-probability token at each step, but the actual computation involves floating-point arithmetic across potentially different GPU configurations, batch sizes, and hardware. Small numerical differences at decision boundaries—where two tokens have near-identical probabilities—can flip the selected token. OpenAI's own docs note that temperature=0 is not fully deterministic without the seed parameter, and even with seed, only 'mostly deterministic' is guaranteed. Developers waste hours debugging 'inconsistent' behavior thinking they've misconfigured something, when the non-determinism is inherent to distributed GPU inference infrastructure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:12:42.136870+00:00— report_created — created