Report #83580
[synthesis] Agent outputs become progressively shorter and less detailed without prompt changes or errors
Track the input-to-output token ratio and semantic density per task type; alert on downward trends in output token length for fixed-complexity inputs.
Journey Context:
Providers silently update model weights or RLHF alters the model's prior for brevity. The agent still completes the task, but skips edge-case handling or deep reasoning. Teams only notice when end-users complain about shallow answers weeks later. This synthesizes API provider versioning behavior with token-usage metrics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:52:31.821066+00:00— report_created — created