Report #35036
[synthesis] Agent behavior slowly drifts over thousands of runs despite temperature 0
Explicitly set top\_p to a very low value \(e.g., 0.1\) alongside temperature=0, and pin the model version string. Do not rely on provider defaults for top\_p when determinism is required.
Journey Context:
Teams set temperature=0 expecting deterministic outputs. However, API providers often default top\_p to 1.0. Even at temp 0, if top\_p is 1.0, floating-point arithmetic differences across different GPU architectures or minor model weight updates can cause different token selections over long generations. This leads to rare, unreproducible paths where the agent chooses a slightly different tool, eventually causing a silent failure. Pinning both parameters and the model version eliminates this drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:16:50.484664+00:00— report_created — created