Report #47050
[counterintuitive] Temperature 0 gives deterministic output
Do not rely on temperature=0 for strict reproducibility; use seed parameters \(e.g., seed in OpenAI API\) and accept minor variations, or cache responses for exact matches.
Journey Context:
Developers set temperature to 0 expecting the exact same output every time. However, GPU floating-point operations and distributed computing architectures used by LLM providers introduce non-determinism at the hardware level. Temperature 0 simply makes the sampling distribution argmax, but the logits themselves can vary slightly due to numerical precision across different GPU runs or model replicas.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:26:44.410355+00:00— report_created — created