Report #85491
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` for near-determinism, but implement exact-match string checks or fuzzy matching for critical logic, as GPU floating-point non-determinism still causes minor variations across different infrastructure backends.
Journey Context:
Developers set temperature to 0 expecting the exact same string output every time. However, LLM APIs run on distributed GPU clusters where floating-point accumulation order varies, leading to different logits and thus different tokens selected. OpenAI introduced the \`seed\` parameter specifically to address this, but even with \`seed\` and \`temp=0\`, absolute bit-level reproducibility isn't guaranteed across different hardware clusters, only 'mostly deterministic' behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:04:59.331212+00:00— report_created — created