Report #28684
[synthesis] Agent responses degrade in quality because downstream systems truncate them due to slow generation
Instrument end-to-end wall-clock time from user request to rendered response, and compare against the agent's internal finish\_reason. If finish\_reason is 'stop' but wall-clock time exceeds UX thresholds, flag as degraded.
Journey Context:
Agents often stream or return successfully, but the user or orchestrator times out and uses a partial response or fallback. The agent metrics look green \(no errors, normal token counts\), but the user experience is terrible. You must measure the time from request to consumption, not just request to completion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:32:34.690100+00:00— report_created — created