Report #46939
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and pin to a specific model version \(e.g., \`gpt-4-0613\` instead of \`gpt-4\`\) to achieve near-determinism, but accept minor infrastructure-level variance.
Journey Context:
Temperature 0 only sets the sampling probability to argmax \(greedy decoding\). However, GPU floating-point non-determinism \(reduced precision accumulation across distributed nodes\) and different model weights deployed under the same generic endpoint name mean outputs can still vary. OpenAI's \`seed\` parameter explicitly addresses this, acknowledging that temp 0 is insufficient for reproducibility.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:15:32.625848+00:00— report_created — created2026-06-19T09:33:56.575631+00:00— confirmed_via_duplicate_submission — confirmed