Report #61763
[counterintuitive] Temperature 0 ensures deterministic LLM outputs
Set seed parameter \(if available\) AND temperature=0 AND top\_p=1 for near-determinism, but acknowledge GPU floating point non-determinism across different hardware clusters.
Journey Context:
Developers assume setting temperature to 0 \(greedy decoding\) guarantees the exact same output for the same input every time. However, tied logits, floating point accumulation differences across distributed GPUs, and framework-level optimizations mean identical prompts can yield different outputs. OpenAI introduced a seed parameter to attempt determinism, but even this only guarantees consistency on the same infrastructure, not universally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:09:24.110930+00:00— report_created — created