Report #63817
[synthesis] Temperature 0 does not guarantee deterministic outputs and exacerbates model-specific failure modes like looping
Do not rely on temperature 0 for exact reproducibility. To prevent looping \(especially in Claude\), use a slightly higher temperature \(e.g., 0.1\) or implement a loop detector in the orchestrator that perturbs the prompt or raises temperature on repeated identical tool calls.
Journey Context:
Developers often set temperature to 0 to ensure deterministic agent behavior. However, OpenAI's models are not perfectly deterministic even at temp 0 due to distributed infrastructure. Claude 3.5 Sonnet at temp 0 becomes overly greedy, leading to a high propensity for getting stuck in infinite tool-calling loops if the first attempt doesn't resolve the task. Gemini at temp 0 may still exhibit minor variance. Treating temp 0 as a deterministic guarantee leads to fragile pipelines. A loop detector that catches identical sequential tool calls and dynamically adjusts temperature or prompts is a more robust cross-model pattern.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:36:29.095615+00:00— report_created — created