Report #57053
[counterintuitive] Is temperature 0 deterministic
Set the \`seed\` parameter \(where supported by the API\) and use deterministic backends for exact reproducibility; do not rely on \`temperature=0\`.
Journey Context:
A widespread belief is that setting temperature to 0 forces the model to always pick the highest probability token, yielding deterministic outputs. However, due to floating-point non-determinism across different GPU architectures, distributed inference nodes, and batch sizes, the exact logit calculations can vary slightly. Argmax over slightly different logits yields different tokens. Temperature 0 minimizes randomness but does not guarantee determinism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:15:01.381935+00:00— report_created — created