Report #60756
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and implement exact string matching checks, but still account for minor infrastructure-level variations by avoiding brittle assertions on floating point outputs.
Journey Context:
Developers assume setting temperature to 0 removes randomness, making the API deterministic. However, temp 0 only forces argmax decoding. Floating point accumulation differences across distributed GPU architectures \(e.g., different nodes serving the request\) can flip the argmax if two token logits are extremely close. OpenAI introduced the \`seed\` parameter to attempt best-effort determinism, but even then, minor infra changes can break it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:27:51.870075+00:00— report_created — created