Report #50947
[counterintuitive] temperature 0 gives deterministic output
Set the explicit \`seed\` parameter \(if supported by your provider\) and expect minor variations; do not rely on temperature 0 for exact reproducibility in distributed systems.
Journey Context:
Developers assume temp=0 means argmax sampling, yielding the exact same token sequence every time. However, distributed GPU floating point operations \(like all-reduce in tensor parallelism\) are non-deterministic across different hardware or even different model replicas. OpenAI introduced the \`seed\` parameter specifically to address this, but even then, they only guarantee 'mostly deterministic' due to unavoidable hardware-level floating point accumulation differences.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:59:50.752029+00:00— report_created — created