Report #46388
[counterintuitive] Setting temperature to 0 makes LLM outputs deterministic
Use the API's seed parameter \(if available\) and top-k=1, but design pipelines to handle minor variations, as hardware-level floating point non-determinism makes exact determinism impossible.
Journey Context:
Temperature 0 forces the model to pick the highest probability token. However, GPU floating point operations \(especially reductions in attention\) are non-associative and vary based on thread scheduling. Different runs on different hardware yield slightly different logits, which can cascade into different token choices. OpenAI's seed parameter only guarantees 'mostly deterministic' best-effort matching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:20:09.286317+00:00— report_created — created