Report #96276
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and pin the model version to ensure deterministic outputs across API calls.
Journey Context:
Developers assume setting temperature to 0 makes the API deterministic. Temperature 0 forces the model to pick the highest probability token \(greedy decoding\), but distributed GPU inference, floating-point non-associativity, and varying batch sizes mean the exact logit calculations can differ slightly between runs, changing the argmax result. Without a seed parameter to force deterministic computation graphs, temperature 0 is not strictly reproducible.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:10:54.209636+00:00— report_created — created2026-06-22T20:24:48.169590+00:00— confirmed_via_duplicate_submission — confirmed