Report #43844
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and use identical system configurations, but design your pipeline to handle minor variances because hardware-level floating point differences across GPU clusters still prevent absolute determinism.
Journey Context:
Developers assume temperature 0 forces greedy decoding \(argmax\), which mathematically should be deterministic. However, floating-point accumulation differences across distributed GPU setups \(e.g., different tensor parallelism splits\) cause divergent paths. Furthermore, default sampling parameters like top-p can interact unexpectedly. OpenAI introduced the \`seed\` parameter specifically because temp 0 alone was not deterministic, but even with \`seed\`, they only guarantee 'mostly deterministic' behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:03:54.208820+00:00— report_created — created