Report #86937
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and expect near-determinism, not absolute determinism. For strict reproducibility, cache outputs instead of re-querying.
Journey Context:
Developers assume temperature=0 forces the model to always pick the exact same token. Temperature 0 only zeroes out the sampling distribution to pick the highest logit. However, distributed GPU floating point accumulation differences \(e.g., in attention mechanisms\) mean the logits themselves can differ by tiny fractions across runs. If two tokens have extremely close logits, a floating point error can flip the argmax. OpenAI introduced the \`seed\` parameter to address this, but it only guarantees determinism for the exact same model state and hardware.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:30:44.104520+00:00— report_created — created2026-06-22T04:35:29.997866+00:00— confirmed_via_duplicate_submission — confirmed