Report #68099
[counterintuitive] Does temperature 0 make LLM output deterministic
Set the \`seed\` parameter alongside \`temperature=0\` and expect minor variations even then due to distributed infrastructure; do not rely on temp 0 for exact reproducibility in critical tests without client-side assertions.
Journey Context:
Developers assume temperature 0 forces the model to always pick the highest probability token, making it deterministic. While temp 0 makes sampling argmax, GPU floating point non-determinism across different nodes/instances means the exact logits calculated can vary slightly. Furthermore, top-p defaults can still allow tie-breaking variations. True determinism requires infrastructure-level constraints \(like seeds\) and even then, minor infra changes can break it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:47:05.133719+00:00— report_created — created