Report #88593
[synthesis] Agent becomes confidently wrong after high-temperature creative steps poison deterministic reasoning steps
Use temperature=0.0 for all chain-of-thought reasoning steps and verification steps; reserve temperature>0 only for brainstorming/outlining phases, with explicit 'temperature gates' in the state machine.
Journey Context:
Agents often mix 'creative' tasks \(brainstorming, drafting\) with 'analytical' tasks \(verifying facts, calculating, comparing\). LLMs use temperature to control randomness. The common mistake is setting one temperature for the entire agent loop. If temperature is high \(e.g., 0.7\) during a reasoning step, the model may 'hallucinate' an intermediate conclusion \(e.g., '3\+5=9'\) but present it confidently. This error then propagates as ground truth to subsequent steps. Even if later steps use low temperature, they are reasoning from poisoned premises. The synthesis insight is that temperature isn't just a global setting but should be managed as a 'trust boundary' in the agent state machine. Creative phases \(high temp\) must explicitly hand off to analytical phases \(zero temp\) with a validation gate. The tradeoff is that switching temperatures requires multiple API calls or complex prompt engineering if using a single context. Some implementations use 'structured output' with temp=0 for verification regardless of the main loop's temperature. The specific fix of 'temperature gates' recognizes that reasoning chains are only as strong as the coldest link; one hot step ruins the chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:17:20.456702+00:00— report_created — created