Report #96651
[counterintuitive] Does setting temperature to 0 make LLM outputs deterministic?
Set the \`seed\` parameter alongside \`temperature=0\` and pin model versions, but recognize that absolute determinism across different hardware backends is still not guaranteed due to floating-point variations.
Journey Context:
Developers assume temperature 0 forces argmax, yielding the exact same token sequence every time. While temp 0 does force argmax sampling, floating-point precision differences across distributed GPU architectures can flip the probability order of nearly tied tokens. OpenAI introduced the \`seed\` parameter specifically to address this, noting it provides 'mostly deterministic' outputs, but absolute bit-level determinism remains impossible to guarantee across different inference infrastructure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:48:47.108870+00:00— report_created — created