Report #70293
[counterintuitive] Does temperature 0 make LLM output deterministic
Use the \`seed\` parameter \(if available, e.g., OpenAI API\) and pin the model version to achieve near-deterministic outputs; do not rely on temperature=0 alone for reproducibility.
Journey Context:
Developers set temperature to 0 expecting bit-perfect reproducible results across runs. However, GPU floating-point non-determinism across distributed inference clusters and the underlying implementation of top-p/top-k sampling mean temperature 0 still yields slight variations. API providers introduced explicit \`seed\` parameters to guarantee deterministic caching and reproduction, but even then, exact determinism is only guaranteed if the model architecture, weights, and infrastructure remain completely unchanged between calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:34:09.138841+00:00— report_created — created