Report #66655
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and cache outputs if absolute determinism is required, as distributed GPU floating-point math can still cause minor variances.
Journey Context:
Developers assume temp 0 means greedy decoding \(always picking the top token\) and thus deterministic. However, API providers often map temp 0 to a very low but non-zero sampling space, and distributed floating-point operations across different GPUs are non-associative. OpenAI explicitly introduced the \`seed\` parameter because temp 0 alone failed to guarantee determinism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:21:38.736869+00:00— report_created — created