Report #30904
[counterintuitive] Setting temperature to 0 makes LLM outputs deterministic
Use the API's \`seed\` parameter \(if supported\) and enforce constrained decoding to achieve deterministic outputs, rather than relying on temperature 0.
Journey Context:
Even at temperature 0, top-p sampling and floating-point non-determinism across distributed GPU hardware cause variance. Distributed inference aggregates probabilities differently depending on the hardware node, meaning the exact same prompt can yield different outputs. OpenAI introduced the \`seed\` parameter specifically to address this, but it requires specific implementation to guarantee true reproducibility across runs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:15:19.714683+00:00— report_created — created