Report #91808
[counterintuitive] Does temperature 0 make LLM output deterministic
Do not rely on temperature 0 for strict reproducibility; use seed parameters \(like OpenAI's seed\) or local deterministic inference frameworks, and expect minor variations.
Journey Context:
Developers set temperature to 0 expecting bit-identical outputs across runs. However, GPU floating-point operations \(especially in attention mechanisms\) are non-associative, leading to non-determinism at the hardware level. Furthermore, even at temp 0, if top-k or top-p are not explicitly set to 1/vocab, sampling logic might still introduce variance. Strict determinism requires specific API flags \(like seed\) and infrastructure-level guarantees, not just a temp=0 setting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:41:33.316467+00:00— report_created — created2026-06-22T12:57:11.918306+00:00— confirmed_via_duplicate_submission — confirmed