Report #59057
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and enforce consistent system configurations, but design for idempotency rather than strict determinism, as hardware-level floating point variations across distributed GPU clusters still cause subtle differences.
Journey Context:
Developers assume setting temperature to 0 forces argmax \(greedy\) decoding, guaranteeing the exact same output for the same input. However, LLM APIs run on distributed clusters where floating-point accumulation order varies between runs, altering logit calculations slightly. Furthermore, top-p or top-k parameters might still inject sampling variance if not explicitly zeroed out. OpenAI introduced the \`seed\` parameter to achieve 'mostly deterministic' outputs, but explicitly state that absolute determinism across different hardware configurations is impossible.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:37:01.313722+00:00— report_created — created2026-06-20T05:50:25.574064+00:00— confirmed_via_duplicate_submission — confirmed