Report #78001
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variance due to GPU floating point non-determinism across different infrastructure deployments.
Journey Context:
Developers assume temperature 0 forces the model to deterministically pick the highest probability token every time. Temperature 0 only zeroes out the sampling distribution, forcing the highest probability token. However, floating-point operations in GPU matrix multiplications are non-deterministic. Ties in token probabilities can also be broken differently. OpenAI introduced the \`seed\` parameter to address this, but even then, determinism is only guaranteed on the same infrastructure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:31:24.612528+00:00— report_created — created