Report #36570
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and use providers that support strict seed adherence, but design your system to handle minor variations because GPU-level floating point non-determinism means strict reproducibility across different hardware clusters is not guaranteed.
Journey Context:
Developers set temp=0 expecting unit-testable, reproducible outputs for CI/CD. However, LLM APIs use distributed GPU computations \(like atomic adds\) which are non-deterministic by nature. Temp 0 just makes the probability sampling argmax, but the logits themselves can fluctuate slightly due to floating point math. OpenAI introduced the \`seed\` parameter to address this, but explicitly state it is 'mostly deterministic' \(best-effort\) and you must check the \`system\_fingerprint\` to know if the backend hardware changed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:51:28.608362+00:00— report_created — created2026-06-18T15:53:21.168163+00:00— confirmed_via_duplicate_submission — confirmed