Report #69167
[counterintuitive] Setting temperature to 0 for deterministic LLM outputs
Set the seed parameter \(if supported by the API\) and use structured outputs for near-determinism, but do not rely on temperature=0 for exact reproducibility across different API calls.
Journey Context:
Developers set temperature to 0 assuming it forces argmax decoding and thus identical outputs every time. While it theoretically forces argmax, in practice, floating-point non-determinism in GPU operations and API-level infrastructure \(different backend nodes, top-p interactions\) means outputs can vary slightly. True determinism requires explicit seed parameters and specific API support.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:34:51.956077+00:00— report_created — created