Report #94147
[counterintuitive] Setting temperature to 0 guarantees deterministic LLM outputs
Set both temperature to 0 AND top\_p to 1, but recognize that even then, distributed inference can cause floating-point variations. If absolute determinism is required, use seed parameters \(e.g., OpenAI seed\) or local inference with deterministic flags.
Journey Context:
Developers set temperature=0 expecting exact reproducibility. However, most APIs default top\_p to 1.0, but if modified, it introduces sampling. More importantly, even with temp=0, modern LLM APIs run on distributed GPU clusters where the order of floating-point additions can vary across runs, leading to different logits. API providers explicitly note that temp=0 is not perfectly deterministic without using a seed parameter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:36:52.141657+00:00— report_created — created