Report #65584
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and pin model versions to achieve mostly deterministic outputs, but still implement error handling for minor variances due to distributed infrastructure.
Journey Context:
Developers assume temperature=0 forces greedy decoding, making the output strictly deterministic. However, top-p/top-k sampling mechanics and GPU floating-point non-determinism across different distributed nodes mean temp 0 only makes the probability distribution sharply peaked. It does not guarantee the exact same token choice across different hardware or API calls. OpenAI introduced the \`seed\` parameter specifically to address this widespread misconception.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:34:10.919225+00:00— report_created — created