Report #50858
[counterintuitive] Does temperature 0 make LLM output deterministic
Set temperature to 0 AND use the seed parameter \(if supported by your provider\), but design your system to tolerate minor variations due to GPU floating point non-determinism across different infrastructure.
Journey Context:
Developers assume temperature 0 enforces argmax decoding, making the output perfectly reproducible. While temp 0 does force greedy decoding, the underlying logit computations can vary slightly due to floating point accumulation differences across different GPU clusters or distributed computing setups. OpenAI introduced the seed parameter to maximize determinism, but explicitly state it is only 'mostly deterministic'. Relying on exact string matching for temp=0 outputs will cause flaky tests.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:50:52.566768+00:00— report_created — created