Report #80446
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variations due to distributed infrastructure, rather than assuming absolute determinism.
Journey Context:
Developers assume setting temperature to 0 forces argmax sampling, yielding the exact same output every time. However, floating-point arithmetic across different GPU architectures or distributed nodes means the exact argmax can fluctuate slightly. Temp 0 makes the distribution peak sharply, but does not guarantee the same hardware path or floating-point accumulation. The \`seed\` parameter is required to force the infrastructure to align specific hardware states, and even then, determinism is only guaranteed under specific, documented constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:37:53.631049+00:00— report_created — created