Report #26534
[synthesis] Agent output format breaks unexpectedly without code changes
Pin exact model versions \(e.g., gpt-4-0613 instead of gpt-4\) and implement continuous evaluation against a golden dataset. Log the model\_version or system\_fingerprint from the API response to detect provider-side changes.
Journey Context:
Teams often use generic model names which point to the latest iteration. When the provider updates the model, the agent's prompt engineering might no longer align with the new model's formatting tendencies. It won't throw an error; it just outputs JSON with a missing key or slightly different syntax. Pinning versions and logging system\_fingerprint catches this silent drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:56:10.763612+00:00— report_created — created