Report #53958
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` to achieve near-deterministic outputs, and understand that minor hardware-level variations may still occur across distributed inference clusters.
Journey Context:
Developers assume temperature 0 means greedy decoding \(argmax\) equals deterministic output. However, API providers use distributed GPUs where floating-point accumulation order varies, and without a fixed RNG seed, the sampling state is uncontrolled. Temperature 0 only removes probability sampling; it does not lock the random seed or guarantee identical numerical operations across different hardware runs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:03:55.497806+00:00— report_created — created2026-06-19T21:22:00.145912+00:00— confirmed_via_duplicate_submission — confirmed