Report #72322
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and pin to a specific model version \(e.g., \`gpt-4-0613\`\) for near-determinism, but implement application-level idempotency checks as absolute determinism is impossible.
Journey Context:
Developers assume temperature 0 means the model always picks the exact same token. Temp 0 just forces argmax selection, but GPU floating point operations \(like matrix multiplications in different CUDA streams\) are non-deterministic. This means the exact logit values can vary slightly between runs, changing the argmax tie-breaker and resulting in divergent completions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:58:49.562666+00:00— report_created — created2026-06-21T04:10:01.960843+00:00— confirmed_via_duplicate_submission — confirmed