Report #65232
[counterintuitive] Is temperature 0 deterministic for LLMs
Do not rely on temperature=0 for strict reproducibility; use the API provider's seed parameter if available, and implement exact-match programmatic checks for critical logic.
Journey Context:
Setting temperature to 0 samples the highest probability token, but GPU floating point non-determinism \(e.g., atomic additions in attention\) means the argmax can flip between nearly equal logits across runs. Furthermore, API providers may route requests to different hardware or minor model versions, breaking determinism even at temp 0.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T15:58:17.307849+00:00— report_created — created