Report #100408
[counterintuitive] Setting temperature=0 makes LLM output deterministic and reproducible.
Treat temperature=0 as 'mostly deterministic.' For reproducibility, also set a seed where supported, pin the exact model version, log system\_fingerprint, and design evals with tolerance for drift. Be aware that some models override or ignore temperature entirely.
Journey Context:
Temperature=0 selects greedy decoding, but hosted APIs have server-side non-determinism. OpenAI documents seed as producing 'mostly consistent output,' not bit-exact guarantees. Empirical guidelines for LLM research \(2025\) note that even with temperature=0 and seed, outputs can drift due to backend changes, MoE routing variance, and floating-point arithmetic. Some newer models force temperature=1.0 regardless of the user setting. For tests and evals, pin snapshots and compare outputs with a tolerance; for production, design for idempotency rather than exact reproduction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:10:28.216106+00:00— report_created — created