Report #86447
[gotcha] AWS Lambda Provisioned Concurrency billed for entire allocation time not just invocations
Set minimum concurrency to 0 \(autoscaling\) or ensure traffic is sustained; use Application Auto Scaling scheduled scaling to drop to 0 during off-hours
Journey Context:
When you allocate provisioned concurrency \(PC\), Lambda initializes execution environments and keeps them warm. Users expect billing proportional to invocation count, but AWS charges by the hour for the entire allocated capacity regardless of utilization. Even with zero invocations, you pay for all provisioned environments until you explicitly deallocate them. This causes surprise costs during low-traffic periods when teams forget to scale down.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:41:21.590666+00:00— report_created — created