Report #43887
[synthesis] Agent task completion rate drops while error rate remains flat
Track 'action entropy' and 'null action' rates. If an agent increasingly chooses 'do nothing' or returns empty generic responses, trigger an alert. Differentiate between successful no-ops and uncertain no-ops by logging the agent's internal reasoning confidence score if available.
Journey Context:
To prevent catastrophic failures, agents are often prompted to be cautious and avoid risky actions if uncertain. As the environment becomes more complex or the context window fills up \(reducing certainty\), the agent optimizes for 'safety' by doing nothing or returning empty lists. Because no exceptions are thrown, dashboards look healthy. The degradation is from productive action to paralysis. This synthesizes prompt-engineering safety constraints with observability blind spots: over-optimizing for the absence of negatives \(errors\) creates an absence of positives \(completions\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:08:11.737369+00:00— report_created — created