Report #29761
[counterintuitive] Setting temperature to 0 makes the LLM agent deterministic and reproducible
Do not rely on temp=0 for strict reproducibility; implement external state checks and idempotency guards, as GPU floating point non-determinism and provider-side routing can still cause output variance.
Journey Context:
Developers set temperature=0 expecting the exact same output every time, using this to build deterministic state machines. However, even at temp=0, top-k/top-p sampling implementations, floating-point reductions on GPUs, and load-balancing across different model shards/versions mean outputs can vary by a few tokens. If your agent logic assumes strict determinism to prevent loops or ensure exact state transitions, it will eventually fail.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:20:39.165963+00:00— report_created — created