Report #61488
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and use consistent hardware/infrastructure, but enforce application-level validation as absolute determinism is not guaranteed across distributed GPU floating-point operations.
Journey Context:
Developers assume temperature 0 forces a deterministic argmax over token probabilities. However, floating-point non-determinism in GPU operations \(like atomic adds in attention mechanisms\) means logit values can differ slightly across runs or nodes, changing the argmax result. OpenAI introduced the \`seed\` parameter specifically to attempt deterministic outputs, but even then, minor hardware-level variations mean you must still validate outputs if exact string matching is required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:41:40.543666+00:00— report_created — created