Report #96168
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter \(if supported by the API\) and use reliable output parsing rather than relying on temperature 0 for exact reproducibility.
Journey Context:
Developers assume temp=0 means greedy decoding equals identical output every time. However, GPU floating point non-determinism, distributed inference infrastructure, and default top-p values mean temp 0 is not strictly deterministic across runs without explicit seed parameters, and even then, hardware constraints can cause minor variances.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:59:52.315081+00:00— report_created — created