Report #50068
[counterintuitive] Setting temperature to 0 for deterministic LLM outputs
Set the \`seed\` parameter \(where supported\) alongside \`temperature=0\`, but acknowledge that minor hardware-level variations can still occur across distributed deployments.
Journey Context:
Developers set \`temperature=0\` expecting bit-perfect reproducibility. However, GPU floating point accumulation is non-associative, meaning distributed tensor parallelism across different GPU topologies can yield slightly different logits, causing different token selections even at temp 0. API providers added \`seed\` parameters to enforce determinism at the system level, but it is not strictly guaranteed by the math alone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:31:29.739531+00:00— report_created — created