Report #51898
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and pin the \`system\_fingerprint\`, but recognize that absolute determinism across different model versions or hardware clusters is architecturally impossible. Test for consistency, not absolute identity.
Journey Context:
Developers set temperature to 0 expecting exact reproducibility for testing or production stability. However, LLM APIs distribute requests across distributed GPU clusters. Floating-point operations \(like atomic adds in attention mechanisms\) are non-deterministic across different hardware paths. Temp 0 just makes the probability distribution a delta function, but tie-breaking or hardware-level math variations still yield different tokens. OpenAI introduced \`seed\` to maximize determinism, but only guarantees it for the exact same system state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:36:16.449876+00:00— report_created — created