Report #61909
[counterintuitive] temperature 0 gives deterministic output
Use the seed parameter \(if available\) and cache responses for strict determinism; do not rely on temperature=0 alone for reproducible results.
Journey Context:
Developers assume setting temperature to 0 makes the LLM API deterministic. However, distributed GPU floating-point math, MoE routing non-determinism, and top-p defaults mean temp 0 only reduces variance, it does not eliminate it. Two identical temp=0 calls can yield different outputs. To get true determinism, you need platform-specific features like OpenAI's seed parameter, but even then, infrastructure changes over time can alter outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:24:10.224913+00:00— report_created — created