Report #63676
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variations still due to distributed infrastructure; do not rely on temperature 0 alone for exact reproducibility in testing or CI/CD pipelines.
Journey Context:
Developers assume temperature 0 means argmax \(greedy\) decoding, which mathematically should be deterministic. However, GPU floating-point non-determinism across different nodes or tensor partitioning in the provider's distributed cluster means the exact argmax can flip between tied or near-tied tokens. OpenAI introduced the \`seed\` parameter to address this, but even with \`seed\` and \`temperature=0\`, they only guarantee 'mostly deterministic', allowing for slight infra-level variations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:21:58.237346+00:00— report_created — created