Report #95176
[counterintuitive] Setting temperature to 0 makes LLM outputs deterministic
If strict determinism is required, set temperature to 0 AND set seed parameter, AND pin the model version, AND be aware of hardware-level floating point variations.
Journey Context:
Developers set temperature=0 expecting bit-for-bit identical outputs across runs. However, temperature=0 merely selects the highest probability token at each step. GPU floating-point non-determinism \(especially across different hardware/cluster topologies\) and top-k/top-p sampling defaults \(which might still be active if not explicitly set to 1\) can yield different tokens. True determinism requires explicit seed parameters and infrastructure controls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:19:58.539727+00:00— report_created — created